دنبال کردن
Zhangjie Wu
عنوان
نقل شده توسط
نقل شده توسط
سال
Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation
JZ Wu, Y Ge, X Wang, SW Lei, Y Gu, Y Shi, W Hsu, Y Shan, X Qie, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
6032023
Occluded prohibited items detection: An x-ray security inspection benchmark and de-occlusion attention module
Y Wei, R Tao, Z Wu, Y Ma, L Zhang, X Liu
Proceedings of the 28th ACM International Conference on Multimedia, 138-146, 2020
2042020
Show-1: Marrying pixel and latent diffusion models for text-to-video generation
DJ Zhang*, JZ Wu*, JW Liu*, R Zhao, L Ran, Y Gu, D Gao, MZ Shou
International Journal of Computer Vision, 1-15, 2024
1242024
Mix-of-show: Decentralized low-rank adaptation for multi-concept customization of diffusion models
Y Gu, X Wang, JZ Wu, Y Shi, Y Chen, Z Fan, W Xiao, R Zhao, S Chang, ...
Advances in Neural Information Processing Systems 36, 2024
1082024
Mining whole-lung information by artificial intelligence for predicting EGFR genotype and targeted therapy response in lung cancer: a multicohort study
S Wang, H Yu, Y Gan, Z Wu, E Li, X Li, J Cao, Y Zhu, L Wang, H Deng, ...
The Lancet Digital Health 4 (5), e309-e319, 2022
962022
Motiondirector: Motion customization of text-to-video diffusion models
R Zhao, Y Gu, JZ Wu, DJ Zhang, JW Liu, W Wu, J Keppo, MZ Shou
European Conference on Computer Vision, 273-290, 2025
542025
Symbolic replay: Scene graph as prompt for continual learning on vqa task
SW Lei, D Gao, JZ Wu, Y Wang, W Liu, M Zhang, MZ Shou
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 1250-1259, 2023
282023
CVPR 2023 Text Guided Video Editing Competition
JZ Wu, X Li, D Gao, Z Dong, J Bai, A Singh, X Xiang, Y Li, Z Huang, Y Sun, ...
arXiv preprint arXiv:2310.16003, 2023
272023
Towards A Better Metric for Text-to-Video Generation
JZ Wu, G Fang, H Wu, X Wang, Y Ge, X Cun, DJ Zhang, JW Liu, Y Gu, ...
arXiv preprint arXiv:2401.07781, 2024
202024
A novel deep learning framework based mask-guided attention mechanism for distant metastasis prediction of lung cancer
Z Li, S Wang, H Yu, Y Zhu, Q Wu, L Wang, Z Wu, Y Gan, W Li, B Qiu, ...
IEEE Transactions on Emerging Topics in Computational Intelligence 7 (2 …, 2022
172022
Videoswap: Customized video subject swapping with interactive semantic point correspondence
Y Gu, Y Zhou, B Wu, L Yu, JW Liu, R Zhao, JZ Wu, DJ Zhang, MZ Shou, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
162024
Label-efficient online continual object detection in streaming video
JZ Wu, DJ Zhang, W Hsu, M Zhang, MZ Shou
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
102023
Dynvideo-e: Harnessing dynamic nerf for large-scale motion-and view-change human-centric video editing
JW Liu, YP Cao, JZ Wu, W Mao, Y Gu, R Zhao, J Keppo, Y Shan, MZ Shou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images
DJ Zhang, M Xu, JZ Wu, C Xue, W Zhang, X Han, S Bai, MZ Shou
European Conference on Computer Vision, 465-482, 2025
6*2025
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
T Nguyen, Y Bin, J Xiao, L Qu, Y Li, JZ Wu, CD Nguyen, SK Ng, LA Tuan
arXiv preprint arXiv:2406.05615, 2024
42024
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
X Ren, Y Lu, H Liang, Z Wu, H Ling, M Chen, S Fidler, F Williams, ...
arXiv preprint arXiv:2410.20030, 2024
2024
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
R Zhao, H Yuan, Y Wei, S Zhang, Y Gu, L Ran, X Wang, Z Wu, J Zhang, ...
arXiv preprint arXiv:2410.07133, 2024
2024
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–17