Obserwuj
Lei He
Tytuł
Cytowane przez
Cytowane przez
Rok
Neural codec language models are zero-shot text to speech synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
6762023
Learning latent representations for style control and transfer in end-to-end speech synthesis
YJ Zhang, S Pan, L He, ZH Ling
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
3092019
The numerical manifold method: a review
G Ma, X An, LEI He
International Journal of Computational Methods 7 (01), 1-32, 2010
2552010
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers
K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian
arXiv preprint arXiv:2304.09116, 2023
2462023
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality
X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (6), 4234-4245, 2024
2412024
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
1772023
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1510.06168, 2015
1692015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
Y Fan, Y Qian, FK Soong, L He
2015 IEEE international conference on acoustics, speech and signal …, 2015
1642015
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models
Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ...
arXiv preprint arXiv:2403.03100, 2024
1612024
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1511.00215, 2015
1242015
Developing RNN-T models surpassing high-performance hybrid models with customization capability
J Li, R Zhao, Z Meng, Y Liu, W Wei, S Parthasarathy, V Mazalov, Z Wang, ...
arXiv preprint arXiv:2007.15188, 2020
1202020
Robust sequence-to-sequence acoustic modeling with stepwise monotonic attention for neural TTS
M He, Y Deng, L He
arXiv preprint arXiv:1906.00672, 2019
1012019
Development of three-dimensional numerical manifold method for jointed rock slope stability analysis
L He, XM An, GW Ma, ZY Zhao
International Journal of Rock Mechanics and Mining Sciences 64, 22-35, 2013
852013
Conversational end-to-end tts for voice agents
H Guo, S Zhang, FK Soong, L He, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021
802021
Adaspeech 4: Adaptive text to speech in zero-shot scenarios
Y Wu, X Tan, B Li, L He, S Zhao, R Song, T Qin, TY Liu
arXiv preprint arXiv:2204.00436, 2022
752022
Word embedding for recurrent neural network based TTS synthesis
P Wang, Y Qian, FK Soong, L He, H Zhao
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
752015
Modeling progressive failures in rock slopes with non‐persistent joints using the numerical manifold method
X An, Y Ning, G Ma, L He
International Journal for Numerical and Analytical Methods in Geomechanics …, 2014
732014
Development of 3D numerical manifold method
LEI He, G Ma
International Journal of Computational Methods 7 (01), 107-129, 2010
722010
Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021
Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li, L He, S Zhao
arXiv preprint arXiv:2110.12612, 2021
672021
Improving prosody with linguistic and bert derived features in multi-speaker based mandarin chinese neural tts
Y Xiao, L He, H Ming, FK Soong
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
632020
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20