팔로우
Zhifeng Kong
제목
인용
인용
연도
Diffwave: A versatile diffusion model for audio synthesis
Z Kong, W Ping, J Huang, K Zhao, B Catanzaro
ICLR 2021 (oral), 2021
15282021
On fast sampling of diffusion probabilistic models
Z Kong, W Ping
ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and …, 2021
1932021
A conditional point diffusion-refinement paradigm for 3d point cloud completion
Z Lyu, Z Kong, X Xu, L Pan, D Lin
ICLR 2022, 2022
1422022
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro
ICML 2024, 2024
802024
Speech denoising in the waveform domain with self-attention
Z Kong, W Ping, A Dantrey, B Catanzaro
ICASSP 2022, 7867-7871, 2022
802022
Fastened crown: Tightened neural network robustness certificates
Z Lyu, CY Ko, Z Kong, N Wong, D Lin, L Daniel
AAAI 2020, 2020
742020
The expressive power of a class of normalizing flow models
Z Kong, K Chaudhuri
AISTATS 2020, 2020
612020
Multi-object tracking using online metric learning with long short-term memory
X Wan, J Wang, Z Kong, Q Zhao, S Deng
ICIP 2018, 2018
492018
Understanding instance-based interpretability of variational auto-encoders
Z Kong, K Chaudhuri
NeurIPS 2021, 2021
302021
Data redaction from pre-trained gans
Z Kong, K Chaudhuri
IEEE SaTML 2023, 2023
162023
Can Membership Inferencing be Refuted?
Z Kong, AR Chowdhury, K Chaudhuri
arXiv preprint arXiv:2303.03648, 2023
14*2023
Improving text-to-audio models with synthetic captions
Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ...
arXiv preprint arXiv:2406.15487, 2024
112024
Universal approximation of residual flows in maximum mean discrepancy
Z Kong, K Chaudhuri
arXiv preprint arXiv:2103.05793, 2021
92021
Cleanunet 2: A hybrid speech denoising model on waveform and spectrogram
Z Kong, W Ping, A Dantrey, B Catanzaro
INTERSPEECH 2023, 2023
82023
Data redaction from conditional generative models
Z Kong, K Chaudhuri
IEEE SaTML 2024 (Best paper), 2024
72024
Audio Dialogues: Dialogues dataset for audio and music understanding
A Goel, Z Kong, R Valle, B Catanzaro
arXiv preprint arXiv:2404.07616, 2024
62024
Approximate data deletion in generative models
Z Kong, S Alfeld
ECAI 2023, 2022
52022
Automatic audio captioning with encoder fusion, multi-layer aggregation, and large language model enriched summarization
J Jung, D Zhang, HCH Yang, SL Wu, DM Chan, Z Kong, D Ruifan, ...
DCASE Challenge, Tech. Rep, 2024
22024
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
S Ghosh, S Kumar, Z Kong, R Valle, B Catanzaro, D Manocha
arXiv preprint arXiv:2410.02056, 2024
12024
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
S Ghosh, Z Kong, S Kumar, S Sakshi, J Kim, W Ping, R Valle, D Manocha, ...
arXiv preprint arXiv:2503.03983, 2025
2025
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20