Theo dõi
Zhehuai Chen
Zhehuai Chen
Email được xác minh tại nvidia.com - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Google usm: Scaling automatic speech recognition beyond 100 languages
Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ...
arXiv preprint arXiv:2303.01037, 2023
3082023
Maestro: Matched speech text representations through modality matching
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, A Bapna, ...
arXiv preprint arXiv:2204.03409, 2022
1162022
Progressive joint modeling in unsupervised single-channel overlapped speech recognition
Z Chen, J Droppo, J Li, W Xiong
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (1), 184-196, 2017
882017
Palm 2 technical report. arXiv 2023
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 0
87
Knowledge Distillation for Sequence Model.
M Huang, Y You, Z Chen, Y Qian, K Yu
Interspeech, 3703-3707, 2018
722018
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR.
Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen
Interspeech, 3490-3494, 2019
692019
Improving speech recognition using consistent predictions on synthesized speech
G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
672020
End-to-end contextual speech recognition using class language models and a token passing decoder
Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
602019
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection.
Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno
Interspeech, 556-560, 2020
472020
Phone synchronous speech recognition with ctc lattices
Z Chen, Y Zhuang, Y Qian, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (1), 90-101, 2016
452016
Salm: Speech-augmented language model with in-context learning for speech recognition and translation
Z Chen, H Huang, A Andrusenko, O Hrinchuk, KC Puvvada, J Li, S Ghosh, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
402024
Injecting text in self-supervised speech pretraining
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
382021
Joist: A joint speech and text streaming model for asr
TN Sainath, R Prabhavalkar, A Bapna, Y Zhang, Z Huo, Z Chen, B Li, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2023
362023
On modular training of neural acoustics-to-word model for lvcsr
Z Chen, Q Liu, H Li, K Yu
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
362018
Tacotron: Towards end-to-end speech synthesis. arXiv 2017
Y Wang, R Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ...
arXiv preprint arXiv:1703.10135, 2017
342017
Accented speech recognition: Benchmarking, pre-training, and diverse data
A Aksënova, Z Chen, CC Chiu, D Van Esch, P Golik, W Han, L King, ...
arXiv preprint arXiv:2205.08014, 2022
282022
Tts4pretrain 2.0: Advancing the use of text and speech in asr pretraining with consistency and contrastive losses
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
262022
Accelerating rnn-t training and inference using ctc guidance
Y Wang, Z Chen, C Zheng, Y Zhang, W Han, P Haghani
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
242023
Sequence discriminative training for deep learning based acoustic keyword spotting
Z Chen, Y Qian, K Yu
Speech Communication 102, 100-111, 2018
242018
Phone Synchronous Decoding with CTC Lattice.
Z Chen, W Deng, T Xu, K Yu
Interspeech, 1923-1927, 2016
242016
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20