Zhifeng Kong

인용

	전체	2020년 이후
서지정보	2316	2310
h-index	11	11
i10-index	12	12

1100

550

275

825

20202021202220232024202515 110 252 707 1015 205

공개 액세스

모두 보기

자료 7개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Bryan CatanzaroNVIDIAacm.org의 이메일 확인됨
Wei PingDistinguished Research Scientist, NVIDIAnvidia.com의 이메일 확인됨
Rafael ValleNVIDIA, UC Berkeley, CNMATnvidia.com의 이메일 확인됨
Kexin ZhaoSnapsnapchat.com의 이메일 확인됨
Kamalika ChaudhuriUCSD, Metaucsd.edu의 이메일 확인됨
Jiaji HuangDuke University, Baidu Research, Amazon AWSamazon.com의 이메일 확인됨
Arushi GoelResearch Scientist, NVIDIAsms.ed.ac.uk의 이메일 확인됨
Dahua LinThe Chinese University of Hong Kongie.cuhk.edu.hk의 이메일 확인됨
Zhaoyang LyuPhD of Information Engineering, The Chinese University of Hong Konglink.cuhk.edu.hk의 이메일 확인됨
Rohan BadlaniComputer Science, Stanford University, BITS Pilanics.stanford.edu의 이메일 확인됨
Xudong XUResearcher, Shanghai AI Laboratorypjlab.org.cn의 이메일 확인됨
Liang PanShanghai AI Labpjlab.org.cn의 이메일 확인됨
Ching-Yun KoMassachusetts Institute of Technologymit.edu의 이메일 확인됨
Sang-gil LeeNVIDIA, Seoul National Universitynvidia.com의 이메일 확인됨
Jinjun WangXian Jiaotong University, Chinaieee.org의 이메일 확인됨
Amrita Roy ChowdhuryUniversity of Michigan, Ann Arborumich.edu의 이메일 확인됨
Ambuj MehrishResearch Fellow, Singapore University of Technology and Design, Singaporesutd.edu.sg의 이메일 확인됨
Soujanya PoriaAssistant Professor, Singapore University of Technology and Designsutd.edu.sg의 이메일 확인됨
Navonil MajumderSingapore University of Technology and Designsutd.edu.sg의 이메일 확인됨
Deepanway GhosalDeepMindgoogle.com의 이메일 확인됨

팔로우

Zhifeng Kong

NVIDIA

ucsd.edu의 이메일 확인됨 - 홈페이지

Deep Generative Models Diffusion Models Audio Foundation Models Audio LM Trustworthy ML


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Diffwave: A versatile diffusion model for audio synthesis Z Kong, W Ping, J Huang, K Zhao, B Catanzaro ICLR 2021 (oral), 2021	1528	2021
On fast sampling of diffusion probabilistic models Z Kong, W Ping ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and …, 2021	193	2021
A conditional point diffusion-refinement paradigm for 3d point cloud completion Z Lyu, Z Kong, X Xu, L Pan, D Lin ICLR 2022, 2022	142	2022
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro ICML 2024, 2024	80	2024
Speech denoising in the waveform domain with self-attention Z Kong, W Ping, A Dantrey, B Catanzaro ICASSP 2022, 7867-7871, 2022	80	2022
Fastened crown: Tightened neural network robustness certificates Z Lyu, CY Ko, Z Kong, N Wong, D Lin, L Daniel AAAI 2020, 2020	74	2020
The expressive power of a class of normalizing flow models Z Kong, K Chaudhuri AISTATS 2020, 2020	61	2020
Multi-object tracking using online metric learning with long short-term memory X Wan, J Wang, Z Kong, Q Zhao, S Deng ICIP 2018, 2018	49	2018
Understanding instance-based interpretability of variational auto-encoders Z Kong, K Chaudhuri NeurIPS 2021, 2021	30	2021
Data redaction from pre-trained gans Z Kong, K Chaudhuri IEEE SaTML 2023, 2023	16	2023
Can Membership Inferencing be Refuted? Z Kong, AR Chowdhury, K Chaudhuri arXiv preprint arXiv:2303.03648, 2023	14*	2023
Improving text-to-audio models with synthetic captions Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ... arXiv preprint arXiv:2406.15487, 2024	11	2024
Universal approximation of residual flows in maximum mean discrepancy Z Kong, K Chaudhuri arXiv preprint arXiv:2103.05793, 2021	9	2021
Cleanunet 2: A hybrid speech denoising model on waveform and spectrogram Z Kong, W Ping, A Dantrey, B Catanzaro INTERSPEECH 2023, 2023	8	2023
Data redaction from conditional generative models Z Kong, K Chaudhuri IEEE SaTML 2024 (Best paper), 2024	7	2024
Audio Dialogues: Dialogues dataset for audio and music understanding A Goel, Z Kong, R Valle, B Catanzaro arXiv preprint arXiv:2404.07616, 2024	6	2024
Approximate data deletion in generative models Z Kong, S Alfeld ECAI 2023, 2022	5	2022
Automatic audio captioning with encoder fusion, multi-layer aggregation, and large language model enriched summarization J Jung, D Zhang, HCH Yang, SL Wu, DM Chan, Z Kong, D Ruifan, ... DCASE Challenge, Tech. Rep, 2024	2	2024
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data S Ghosh, S Kumar, Z Kong, R Valle, B Catanzaro, D Manocha arXiv preprint arXiv:2410.02056, 2024	1	2024
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities S Ghosh, Z Kong, S Kumar, S Sakshi, J Kim, W Ping, R Valle, D Manocha, ... arXiv preprint arXiv:2503.03983, 2025		2025

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자