Kainan Peng

Cytowane przez

	Wszystkie	Od 2020
Cytowania	3030	2479
h-indeks	11	11
i10-indeks	12	12

560

280

140

420

20172018201920202021202220232024202515 138 369 432 537 556 500 388 61

Współautorzy

Wei PingDistinguished Research Scientist, NVIDIAZweryfikowany adres z nvidia.com
Sercan O. ArikGoogleZweryfikowany adres z google.com
Yanqi ZhouGoogleZweryfikowany adres z google.com
Gregory DiamosLaminiZweryfikowany adres z landing.ai
Sharan NarangResearch Engineer, Meta AIZweryfikowany adres z meta.com
Ajay KannanGoogleZweryfikowany adres z google.com
Jitong ChenByteDanceZweryfikowany adres z cse.ohio-state.edu

Obserwuj

Kainan Peng

Amazon

Zweryfikowany adres z alumni.cmu.edu

Text-to-Speech Computer Engineering Machine Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning W Ping, K Peng, A Gibiansky, SO Arik, A Kannan, S Narang, J Raiman, ... ICLR 2018, 2018	921*	2018
Deep voice 2: Multi-speaker neural text-to-speech A Gibiansky, S Arik, G Diamos, J Miller, K Peng, W Ping, J Raiman, ... NIPS 2017, 2962-2970, 2017	659*	2017
Neural voice cloning with a few samples S Arik, J Chen, K Peng, W Ping, Y Zhou NeurIPS 2018, 10019-10029, 2018	484	2018
ClariNet: Parallel wave generation in end-to-end text-to-speech W Ping, K Peng, J Chen ICLR 2019, 2018	438	2018
Non-Autoregressive Neural Text-to-Speech K Peng, W Ping, Z Song, K Zhao ICML 2020, 2019	179*	2019
WaveFlow: A Compact Flow-based Model for Raw Audio W Ping, K Peng, K Zhao, Z Song ICML 2020, 2019	156	2019
Systems and methods for neural voice cloning with a few samples C Jitong, P Kainan, P Wei, Z Yanqi US Patent 11,238,843, 2022	57	2022
Incremental text-to-speech synthesis with prefix-to-prefix framework M Ma, B Zheng, K Liu, R Zheng, H Liu, K Peng, K Church, L Huang arXiv preprint arXiv:1911.02750, 2019	36	2019
Systems and methods for parallel wave generation in end-to-end text-to-speech P Wei, P Kainan, C Jitong US Patent 10,872,596, 2020	24	2020
Multi-speaker end-to-end speech synthesis J Park, K Zhao, K Peng, W Ping arXiv preprint arXiv:1907.04462, 2019	20	2019
Parallel neural text-to-speech P Kainan, P Wei, S Zhao, Z Kexin US Patent 11,017,761, 2021	15	2021
Systems and methods for neural text-to-speech using convolutional sequence learning P Wei, P Kainan, S NARANG, A KANNAN, A GIBIANSKY, J RAIMAN, ... US Patent 10,796,686, 2020	11	2020
Systems and methods for multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi US Patent 10,896,669, 2021	7	2021
Neural voice cloning with a few samples OA Sercan, C Jitong, P Kainan, P Wei, Y Zhou Proc. 32nd Int. Conf. Neural Inf. Process. Syst., 10040-10050, 2018	5	2018
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing P Anastassiou, Z Tang, K Peng, D Jia, J Li, M Tu, Y Wang, Y Wang, M Ma arXiv preprint arXiv:2404.06674, 2024	4	2024
Deep Voice 3: scaling text-to-speech with convolutional sequence learning P Wei, P Kainan, G Andrew, SO Arik, A Kannan, S Narang, J Raiman, ... arXiv preprint, 2017	4	2017
Zero-shot accent conversion using pseudo siamese disentanglement network D Jia, Q Tian, K Peng, J Li, Y Chen, M Ma, Y Wang, Y Wang arXiv preprint arXiv:2212.05751, 2022	3	2022
Waveform generation using end-to-end text-to-waveform system P Wei, P Kainan, C Jitong US Patent 11,482,207, 2022	3	2022
Vevo: Controllable zero-shot voice imitation with self-supervised disentanglement X Zhang, X Zhang, K Peng, Z Tang, V Manohar, Y Liu, J Hwang, D Li, ... arXiv preprint arXiv:2502.07243, 2025	2	2025
Multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi US Patent 11,651,763, 2023	1	2023

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy