Följ
Cong Wei
Cong Wei
Verifierad e-postadress på uwaterloo.ca - Startsida
Titel
Citeras av
Citeras av
År
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi
X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
3672024
Mantis: Interleaved multi-image instruction tuning
D Jiang, X He, H Zeng, C Wei, M Ku, Q Liu, W Chen
arXiv preprint arXiv:2405.01483, 2024
472024
Consisti2v: Enhancing visual consistency for image-to-video generation
W Ren, H Yang, G Zhang, C Wei, X Du, W Huang, W Chen
arXiv preprint arXiv:2402.04324, 2024
292024
Viescore: Towards explainable metrics for conditional image synthesis evaluation
M Ku, D Jiang, C Wei, X Yue, W Chen
arXiv preprint arXiv:2312.14867, 2023
242023
Uniir: Training and benchmarking universal multimodal information retrievers
C Wei, Y Chen, H Chen, H Hu, G Zhang, J Fu, A Ritter, W Chen
arXiv preprint arXiv:2311.17136, 2023
232023
Dreamedit: Subject-driven image editing
T Li, M Ku, C Wei, W Chen
arXiv preprint arXiv:2306.12624, 2023
222023
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
M Ku, C Wei, W Ren, H Yang, W Chen
arXiv preprint arXiv:2403.14468, 2024
15*2024
Sparsifiner: Learning sparse instance-dependent attention for efficient vision transformers
C Wei, B Duke, R Jiang, P Aarabi, GW Taylor, F Shkurti
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
122023
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
C Wei, Z Xiong, W Ren, X Du, G Zhang, W Chen
arXiv preprint arXiv:2411.07199, 2024
2024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–9