Obserwuj
Zhiying Huang
Zhiying Huang
ByteDance
Zweryfikowany adres z bytedance.com
Tytuł
Cytowane przez
Cytowane przez
Rok
Seed-tts: A family of high-quality versatile speech generation models
P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ...
arXiv preprint arXiv:2406.02430, 2024
602024
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech
Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
452022
Speaker adaptation of RNN-BLSTM for speech recognition based on speaker code
Z Huang, J Tang, S Xue, L Dai
2016 IEEE International conference on acoustics, speech and signal …, 2016
302016
Polyvoice: Language models for speech to speech translation
Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ...
arXiv preprint arXiv:2306.02982, 2023
242023
Linear networks based speaker adaptation for speech synthesis
Z Huang, H Lu, M Lei, Z Yan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
202018
Speech recognition method and apparatus
Z Huang, S Xue, Z Yan
US Patent App. 15/686,094, 2018
182018
Devicetts: A small-footprint, fast, stable network for on-device text-to-speech
Z Huang, H Li, M Lei
arXiv preprint arXiv:2010.15311, 2020
172020
基于深度学习的语音识别技术现状与展望
戴礼荣, 张仕良, 黄智颖
数据采集与处理 32 (2), 221-231, 2017
102017
RNN-BLSTM 声学模型的说话人自适应方法研究
黄智颖
中国科学技术大学, 2017
22017
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR
S Xue, Z Yan, Z Huang, L Dai
2016 10th International Symposium on Chinese Spoken Language Processing …, 2016
22016
PolyVoice: Language Models for Speech to Speech Translation
Q qian Dong, Z Huang, Q Tian, C Xu, T Ko, S Feng, T Li, K Wang, ...
The Twelfth International Conference on Learning Representations, 2023
12023
Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation.
Z Huang, S Zhang, M Lei
INTERSPEECH, 3377-3381, 2019
2019
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code
Z Huang, S Xue, Z Yan, L Dai
2016 10th International Symposium on Chinese Spoken Language Processing …, 2016
2016
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–13