フォロー
Wei Ji (吉炜)
Wei Ji (吉炜)
Nanjing University | National University of Singapore | Zhejiang University
確認したメール アドレス: nju.edu.cn - ホームページ
タイトル
引用先
引用先
Next-gpt: Any-to-any multimodal llm
S Wu, H Fei, L Qu, W Ji, TS Chua
ICML 2024, 2023
5402023
Deconfounded video moment retrieval with causal intervention
X Yang, F Feng, W Ji, M Wang, TS Chua
Proceedings of the 44th international ACM SIGIR conference on research and …, 2021
1972021
Boundary proposal network for two-stage natural language video localization
S Xiao, L Chen, S Zhang, W Ji, J Shao, L Ye, J Xiao
Proceedings of the AAAI Conference on Artificial Intelligence 35 (4), 2986-2994, 2021
1792021
Invariant grounding for video question answering
Y Li, X Wang, J Xiao, W Ji, TS Chua
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1352022
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
J Xiao, A Yao, Z Liu, Y Li, W Ji, TS Chua
Proceedings of the AAAI Conference on Artificial Intelligence, 2022
1332022
Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey
F Shao, L Chen, J Shao, W Ji, S Xiao, L Ye, Y Zhuang, J Xiao
Neurocomputing, 2021
106*2021
Video Question Answering: Datasets, Algorithms and Challenges
Y Zhong, W Ji, J Xiao, Y Li, W Deng, TS Chua
EMNLP 2022, 2022
1022022
Fine-Grained Scene Graph Generation with Data Transfer
A Zhang, Y Yao, Q Chen, W Ji, Z Liu, M Sun, TS Chua
ECCV, 2022
982022
Fine-tuning multimodal llms to follow zero-shot demonstrative instructions
J Li, K Pan, Z Ge, M Gao, W Ji, W Zhang, TS Chua, S Tang, H Zhang, ...
arXiv preprint arXiv:2308.04152, 2023
772023
Video-of-thought: Step-by-step video reasoning from perception to cognition
H Fei, S Wu, W Ji, H Zhang, M Zhang, ML Lee, W Hsu
arXiv preprint arXiv:2501.03230, 2024
762024
Transfer visual prompt generator across llms
A Zhang, H Fei, Y Yao, W Ji, L Li, Z Liu, TS Chua
NeurIPS, 2023
64*2023
Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment
S Wu, H Fei, W Ji, TS Chua
ACL 2023, 2023
622023
Next-chat: An lmm for chat, detection and segmentation
A Zhang, L Zhao, CW Xie, Y Zheng, W Ji, TS Chua
ICML 2024, 2023
582023
Generating Visual Spatial Description via Holistic 3D Scene Understanding
Y Zhao, H Fei, W Ji, J Wei, M Zhang, M Zhang, TS Chua
ACL 2023, 2023
582023
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms
P Qi, Y Bu, J Cao, W Ji, R Shui, J Xiao, D Wang, TS Chua
AAAI 2023, 2022
582022
Dysen-vdm: Empowering dynamics-aware text-to-video diffusion with llms
H Fei, S Wu, W Ji, H Zhang, TS Chua
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
492024
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models
Y Yao, Q Chen, A Zhang, W Ji, Z Liu, TS Chua, M Sun
EMNLP 2022, 2022
482022
Composed image retrieval with text feedback via multi-grained uncertainty regularization
Y Chen, Z Zheng, W Ji, L Qu, TS Chua
ICLR 2024, 2022
472022
Video visual relation detection via iterative inference
X Shang, Y Li, J Xiao, W Ji, TS Chua
Proceedings of the 29th ACM international conference on Multimedia, 3654-3663, 2021
422021
Content-Variant Reference Image Quality Assessment via Knowledge Distillation
G Yin, W Wang, Z Yuan, C Han, W Ji, S Sun, C Wang
Proceedings of the AAAI Conference on Artificial Intelligence, 2022
402022
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20