Lei He

Cytowane przez

	Wszystkie	Od 2020
Cytowania	4846	3923
h-indeks	33	29
i10-indeks	76	65

1600

800

400

1200

201020112012201320142015201620172018201920202021202220232024202517 25 15 39 41 48 150 144 213 182 272 377 439 929 1511 361

Dostęp publiczny

Wyświetl wszystko

5 artykułów

7 artykułów

dostępne

niedostępne

Objęte finansowaniem

Obserwuj

Lei He

Principal Scientist Manager, Microsoft

Zweryfikowany adres z microsoft.com

artificial intelligence human language processing speech synthesis speech recognition pronunciation assessment.


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023	676	2023
Learning latent representations for style control and transfer in end-to-end speech synthesis YJ Zhang, S Pan, L He, ZH Ling ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	309	2019
The numerical manifold method: a review G Ma, X An, LEI He International Journal of Computational Methods 7 (01), 1-32, 2010	255	2010
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	246	2023
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (6), 4234-4245, 2024	241	2024
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023	177	2023
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1510.06168, 2015	169	2015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis Y Fan, Y Qian, FK Soong, L He 2015 IEEE international conference on acoustics, speech and signal …, 2015	164	2015
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... arXiv preprint arXiv:2403.03100, 2024	161	2024
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1511.00215, 2015	124	2015
Developing RNN-T models surpassing high-performance hybrid models with customization capability J Li, R Zhao, Z Meng, Y Liu, W Wei, S Parthasarathy, V Mazalov, Z Wang, ... arXiv preprint arXiv:2007.15188, 2020	120	2020
Robust sequence-to-sequence acoustic modeling with stepwise monotonic attention for neural TTS M He, Y Deng, L He arXiv preprint arXiv:1906.00672, 2019	101	2019
Development of three-dimensional numerical manifold method for jointed rock slope stability analysis L He, XM An, GW Ma, ZY Zhao International Journal of Rock Mechanics and Mining Sciences 64, 22-35, 2013	85	2013
Conversational end-to-end tts for voice agents H Guo, S Zhang, FK Soong, L He, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021	80	2021
Adaspeech 4: Adaptive text to speech in zero-shot scenarios Y Wu, X Tan, B Li, L He, S Zhao, R Song, T Qin, TY Liu arXiv preprint arXiv:2204.00436, 2022	75	2022
Word embedding for recurrent neural network based TTS synthesis P Wang, Y Qian, FK Soong, L He, H Zhao 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	75	2015
Modeling progressive failures in rock slopes with non‐persistent joints using the numerical manifold method X An, Y Ning, G Ma, L He International Journal for Numerical and Analytical Methods in Geomechanics …, 2014	73	2014
Development of 3D numerical manifold method LEI He, G Ma International Journal of Computational Methods 7 (01), 107-129, 2010	72	2010
Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021 Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li, L He, S Zhao arXiv preprint arXiv:2110.12612, 2021	67	2021
Improving prosody with linguistic and bert derived features in multi-speaker based mandarin chinese neural tts Y Xiao, L He, H Ming, FK Soong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	63	2020

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez