STAT: Spatial-temporal attention mechanism for video captioning C Yan*, Y Tu*, X Wang, Y Zhang, X Hao, Y Zhang, Q Dai IEEE TMM 2019, 2019 | 410 | 2019 |
Long short-term relation transformer with global gating for video captioning L Li, X Gao, J Deng, Y Tu, ZJ Zha, Q Huang IEEE TIP 2022, 2022 | 79 | 2022 |
Video description with spatial-temporal attention Y Tu, X Zhang, B Liu, C Yan ACM MM 2017, 2017 | 70 | 2017 |
Enhancing the alignment between target words and corresponding frames for video captioning Y Tu, C Zhou, J Guo, S Gao, Z Yu Pattern Recognition 2021, 2021 | 56 | 2021 |
SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning Y Tu, L Li, L Su, ZJ Zha, Q Huang IEEE TPAMI 2024, 2024 | 28 | 2024 |
Semantic relation-aware difference representation learning for change captioning Y Tu, T Yao, L Li, J Lou, S Gao, Z Yu, C Yan Findings of ACL 2021, 2021 | 28 | 2021 |
Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning Y Tu, L Li, L Su, J Du, K Lu, Q Huang IEEE TIP 2023, 2023 | 27* | 2023 |
Relation-aware attention for video captioning via graph learning Y Tu, C Zhou, J Guo, H Li, S Gao, Z Yu Pattern Recognition 2023, 2023 | 25 | 2023 |
I2Transformer: Intra-and Inter-relation Embedding Transformer for TV Show Captioning Y Tu, L Li, L Su, S Gao, C Yan, ZJ Zha, Z Yu, Q Huang IEEE TIP 2022, 2022 | 25 | 2022 |
Self-supervised cross-view representation reconstruction for change captioning Y Tu, L Li, L Su, ZJ Zha, C Yan, Q Huang ICCV 2023, 2023 | 23 | 2023 |
R Net: Relation-embedded Representation Reconstruction Network for Change Captioning Y Tu, L Li, C Yan, S Gao, Z Yu EMNLP 2021, 2021 | 19 | 2021 |
I3n: Intra-and inter-representation interaction network for change captioning S Yue, Y Tu, L Li, Y Yang, S Gao, Z Yu IEEE TMM 2023, 2023 | 18 | 2023 |
Neighborhood contrastive transformer for change captioning Y Tu, L Li, L Su, K Lu, Q Huang IEEE TMM 2023, 2023 | 17 | 2023 |
Ls-gan: iterative language-based image manipulation via long and short term consistency reasoning G Cong, L Li, Z Liu, Y Tu, W Qin, S Zhang, C Yan, W Wang, B Jiang ACM MM 2022, 2022 | 15 | 2022 |
Corrections to" STAT: Spatial-Temporal Attention Mechanism for Video Captioning". C Yan*, Y Tu*, X Wang, Y Zhang, X Hao, Y Zhang, Q Dai IEEE TMM 2020, 2020 | 7* | 2020 |
Context-aware Difference Distilling for Multi-change Captioning Y Tu, L Li, L Su, ZJ Zha, C Yan, Q Huang ACL 2024, 2024 | 4 | 2024 |
Multi-grained Representation Aggregating Transformer with Gating Cycle for Change Captioning S Yue, Y Tu, L Li, S Gao, Z Yu ACM TOMM 2024, 2024 | 3 | 2024 |
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning Y Tu, L Li, L Su, C Yan, Q Huang ECCV 2024, 2024 | 1 | 2024 |
MAGIC: Rethinking Dynamic Convolution Design for Medical Image Segmentation S Li, Y Tu, Q Xiang, Z Li ACM MM 2024, 2024 | 1 | 2024 |
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Y Tu, L Li, L Su, Q Huang AAAI 2025, 2025 | | 2025 |