Vector quantized diffusion model for text-to-image synthesis S Gu, D Chen, J Bao, F Wen, B Zhang, D Chen, L Yuan, B Guo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 881 | 2022 |
Paint by example: Exemplar-based image editing with diffusion models B Yang, S Gu, B Zhang, T Zhang, X Chen, X Sun, D Chen, F Wen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 389 | 2023 |
Rodin: A generative model for sculpting 3d digital avatars using diffusion T Wang, B Zhang, T Zhang, S Gu, J Bao, T Baltrusaitis, J Shen, D Chen, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 342 | 2023 |
Styleswin: Transformer-based gan for high-resolution image generation B Zhang, S Gu, B Zhang, J Bao, D Chen, F Wen, Y Wang, B Guo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 301 | 2022 |
Arbitrary style transfer with deep feature reshuffle S Gu, C Chen, J Liao, L Yuan Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 211 | 2018 |
Mask-guided portrait editing with conditional gans S Gu, J Bao, H Yang, D Chen, F Wen, L Yuan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 160 | 2019 |
Efficient diffusion training via min-snr weighting strategy T Hang, S Gu, C Li, J Bao, D Chen, H Hu, X Geng, B Guo Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 137 | 2023 |
High-fidelity and arbitrary face editing Y Gao, F Wei, J Bao, S Gu, D Chen, F Wen, Z Lian Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 99 | 2021 |
Giqa: Generated image quality assessment S Gu, J Bao, D Chen, F Wen Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 98 | 2020 |
Instructdiffusion: A generalist modeling interface for vision tasks Z Geng, B Yang, T Hang, C Li, S Gu, T Zhang, J Bao, Z Zhang, H Li, H Hu, ... Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2024 | 94 | 2024 |
Improved vector quantized diffusion models Z Tang, S Gu, J Bao, D Chen, F Wen arXiv preprint arXiv:2205.16007, 2022 | 68 | 2022 |
Clip itself is a strong fine-tuner: Achieving 85.7% and 88.0% top-1 accuracy with vit-b and vit-l on imagenet X Dong, J Bao, T Zhang, D Chen, S Gu, W Zhang, L Yuan, D Chen, F Wen, ... arXiv preprint arXiv:2212.06138, 2022 | 35 | 2022 |
Step-aware preference optimization: Aligning preference with denoising performance at each step Z Liang, Y Yuan, S Gu, B Chen, T Hang, J Li, L Zheng arXiv preprint arXiv:2406.04314 2 (5), 7, 2024 | 17 | 2024 |
Volumediffusion: Flexible text-to-3d generation with efficient volumetric encoder Z Tang, S Gu, C Wang, T Zhang, J Bao, D Chen, B Guo arXiv preprint arXiv:2312.11459, 2023 | 16 | 2023 |
Simplified diffusion schr\" odinger bridge Z Tang, T Hang, S Gu, D Chen, B Guo arXiv preprint arXiv:2403.14623, 2024 | 11 | 2024 |
Improved noise schedule for diffusion training T Hang, S Gu, X Geng, B Guo arXiv preprint arXiv:2407.03297, 2024 | 8 | 2024 |
Priorgan: Real data prior for generative adversarial nets S Gu, J Bao, D Chen, F Wen arXiv preprint arXiv:2006.16990, 2020 | 6 | 2020 |
Fontstudio: shape-adaptive diffusion model for coherent and consistent font effect generation X Mu, L Chen, B Chen, S Gu, J Bao, D Chen, J Li, Y Yuan European Conference on Computer Vision, 305-322, 2024 | 4 | 2024 |
Cca: Collaborative competitive agents for image editing T Hang, S Gu, D Chen, X Geng, B Guo arXiv preprint arXiv:2401.13011, 2024 | 2 | 2024 |
Several questions of visual generation in 2024 S Gu https://cientgu.github.io/files/VisualSignalDecomposition.pdf, 2024 | 1 | 2024 |