Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation JZ Wu, Y Ge, X Wang, SW Lei, Y Gu, Y Shi, W Hsu, Y Shan, X Qie, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 603 | 2023 |
Occluded prohibited items detection: An x-ray security inspection benchmark and de-occlusion attention module Y Wei, R Tao, Z Wu, Y Ma, L Zhang, X Liu Proceedings of the 28th ACM International Conference on Multimedia, 138-146, 2020 | 204 | 2020 |
Show-1: Marrying pixel and latent diffusion models for text-to-video generation DJ Zhang*, JZ Wu*, JW Liu*, R Zhao, L Ran, Y Gu, D Gao, MZ Shou International Journal of Computer Vision, 1-15, 2024 | 124 | 2024 |
Mix-of-show: Decentralized low-rank adaptation for multi-concept customization of diffusion models Y Gu, X Wang, JZ Wu, Y Shi, Y Chen, Z Fan, W Xiao, R Zhao, S Chang, ... Advances in Neural Information Processing Systems 36, 2024 | 108 | 2024 |
Mining whole-lung information by artificial intelligence for predicting EGFR genotype and targeted therapy response in lung cancer: a multicohort study S Wang, H Yu, Y Gan, Z Wu, E Li, X Li, J Cao, Y Zhu, L Wang, H Deng, ... The Lancet Digital Health 4 (5), e309-e319, 2022 | 96 | 2022 |
Motiondirector: Motion customization of text-to-video diffusion models R Zhao, Y Gu, JZ Wu, DJ Zhang, JW Liu, W Wu, J Keppo, MZ Shou European Conference on Computer Vision, 273-290, 2025 | 54 | 2025 |
Symbolic replay: Scene graph as prompt for continual learning on vqa task SW Lei, D Gao, JZ Wu, Y Wang, W Liu, M Zhang, MZ Shou Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 1250-1259, 2023 | 28 | 2023 |
CVPR 2023 Text Guided Video Editing Competition JZ Wu, X Li, D Gao, Z Dong, J Bai, A Singh, X Xiang, Y Li, Z Huang, Y Sun, ... arXiv preprint arXiv:2310.16003, 2023 | 27 | 2023 |
Towards A Better Metric for Text-to-Video Generation JZ Wu, G Fang, H Wu, X Wang, Y Ge, X Cun, DJ Zhang, JW Liu, Y Gu, ... arXiv preprint arXiv:2401.07781, 2024 | 20 | 2024 |
A novel deep learning framework based mask-guided attention mechanism for distant metastasis prediction of lung cancer Z Li, S Wang, H Yu, Y Zhu, Q Wu, L Wang, Z Wu, Y Gan, W Li, B Qiu, ... IEEE Transactions on Emerging Topics in Computational Intelligence 7 (2 …, 2022 | 17 | 2022 |
Videoswap: Customized video subject swapping with interactive semantic point correspondence Y Gu, Y Zhou, B Wu, L Yu, JW Liu, R Zhao, JZ Wu, DJ Zhang, MZ Shou, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 16 | 2024 |
Label-efficient online continual object detection in streaming video JZ Wu, DJ Zhang, W Hsu, M Zhang, MZ Shou Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 10 | 2023 |
Dynvideo-e: Harnessing dynamic nerf for large-scale motion-and view-change human-centric video editing JW Liu, YP Cao, JZ Wu, W Mao, Y Gu, R Zhao, J Keppo, Y Shan, MZ Shou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 8 | 2024 |
Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images DJ Zhang, M Xu, JZ Wu, C Xue, W Zhang, X Han, S Bai, MZ Shou European Conference on Computer Vision, 465-482, 2025 | 6* | 2025 |
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives T Nguyen, Y Bin, J Xiao, L Qu, Y Li, JZ Wu, CD Nguyen, SK Ng, LA Tuan arXiv preprint arXiv:2406.05615, 2024 | 4 | 2024 |
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats X Ren, Y Lu, H Liang, Z Wu, H Ling, M Chen, S Fidler, F Williams, ... arXiv preprint arXiv:2410.20030, 2024 | | 2024 |
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models R Zhao, H Yuan, Y Wei, S Zhang, Y Gu, L Ran, X Wang, Z Wu, J Zhang, ... arXiv preprint arXiv:2410.07133, 2024 | | 2024 |