Zhiying Huang

Cytowane przez

	Wszystkie	Od 2020
Cytowania	229	193
h-indeks	8	7
i10-indeks	8	6

20162017201820192020202120222023202420256 5 12 9 20 9 14 42 76 29

Dostęp publiczny

Wyświetl wszystko

0 artykułów

1 artykuł

dostępne

niedostępne

Objęte finansowaniem

Obserwuj

Zhiying Huang

ByteDance

Zweryfikowany adres z bytedance.com

Machine Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Seed-tts: A family of high-quality versatile speech generation models P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ... arXiv preprint arXiv:2406.02430, 2024	60	2024
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	45	2022
Speaker adaptation of RNN-BLSTM for speech recognition based on speaker code Z Huang, J Tang, S Xue, L Dai 2016 IEEE International conference on acoustics, speech and signal …, 2016	30	2016
Polyvoice: Language models for speech to speech translation Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ... arXiv preprint arXiv:2306.02982, 2023	24	2023
Linear networks based speaker adaptation for speech synthesis Z Huang, H Lu, M Lei, Z Yan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	20	2018
Speech recognition method and apparatus Z Huang, S Xue, Z Yan US Patent App. 15/686,094, 2018	18	2018
Devicetts: A small-footprint, fast, stable network for on-device text-to-speech Z Huang, H Li, M Lei arXiv preprint arXiv:2010.15311, 2020	17	2020
基于深度学习的语音识别技术现状与展望戴礼荣，张仕良，黄智颖数据采集与处理 32 (2), 221-231, 2017	10	2017
RNN-BLSTM 声学模型的说话人自适应方法研究黄智颖中国科学技术大学, 2017	2	2017
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR S Xue, Z Yan, Z Huang, L Dai 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016	2	2016
PolyVoice: Language Models for Speech to Speech Translation Q qian Dong, Z Huang, Q Tian, C Xu, T Ko, S Feng, T Li, K Wang, ... The Twelfth International Conference on Learning Representations, 2023	1	2023
Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation. Z Huang, S Zhang, M Lei INTERSPEECH, 3377-3381, 2019		2019
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code Z Huang, S Xue, Z Yan, L Dai 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016		2016

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–13

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez