Yu Zhang

Trích dẫn bởi

	Tất cả	Từ 2020
Trích dẫn	28930	25704
h-index	61	60
i10-index	127	115

7000

3500

1750

5250

2015201620172018201920202021202220232024202599 263 419 865 1435 2333 4029 5192 6568 6475 1062

Truy cập công khai

Xem tất cả

7 bài viết

0 bài viết

có sẵn

không có sẵn

Dựa trên yêu cầu tài trợ

Đồng tác giả

Chung-Cheng Chiu (邱中鎮)AppleEmail được xác minh tại apple.com
Wei HanOpenAIEmail được xác minh tại illinois.edu
Ye JiaOpenAIEmail được xác minh tại google.com
Ron J WeissGoogleEmail được xác minh tại google.com
Ruoming Pang (庞若鸣)Apple AI/MLEmail được xác minh tại apple.com
William ChanIdeogramEmail được xác minh tại ideogram.ai
Heiga ZenPrincipal Scientist (Director), Google DeepMindEmail được xác minh tại google.com
James GlassMIT Computer Science and Artificial Intelligence LaboratoryEmail được xác minh tại mit.edu
Bo LiGoogleEmail được xác minh tại google.com
Jonathan ShenGoogleEmail được xác minh tại google.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowEmail được xác minh tại global.tencent.com
Quoc V. LeResearch Scientist, GoogleEmail được xác minh tại stanford.edu
Daniel S. ParkGoogle DeepMindEmail được xác minh tại google.com
Tara SainathDistinguished Research Scientist, Google Deep MindEmail được xác minh tại google.com
Jiahui YuResearch Scientist, OpenAIEmail được xác minh tại openai.com
Zhifeng ChenGoogle Inc.Email được xác minh tại google.com
Wei-Ning HsuFacebook AI Research (FAIR)Email được xác minh tại csail.mit.edu
Anmol GulatiResearcher, Google DeepmindEmail được xác minh tại google.com
Yuxuan WangByteDanceEmail được xác minh tại cse.ohio-state.edu
RJ Skerry-RyanGoogle DeepmindEmail được xác minh tại alum.mit.edu

Theo dõi

Yu Zhang

OpenAI

Email được xác minh tại csail.mit.edu - Trang chủ

Speech Recognition Speech Synthesis


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
Specaugment: A simple data augmentation method for automatic speech recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le arXiv preprint arXiv:1904.08779, 2019	4451	2019
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020	3654	2020
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018	3464	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	1081	2019
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018	1047	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	1032	2018
Wavegrad: Estimating gradients for waveform generation N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan arXiv preprint arXiv:2009.00713, 2020	857	2020
Very deep convolutional networks for end-to-end speech recognition Y Zhang, W Chan, N Jaitly 2017 IEEE international conference on acoustics, speech and signal …, 2017	570	2017
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	480	2021
An introduction to computational networks and the computational network toolkit MS Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian ... Tech. Rep. MSR, Microsoft Research, 2014, http://codebox/cntk, 2014	473*	2014
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020	451	2020
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Advances in neural information processing systems 30, 2017	437	2017
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi IEEE SLT, 2014	419	2014
Pushing the limits of semi-supervised learning for automatic speech recognition Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu arXiv preprint arXiv:2010.10504, 2020	392	2020
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	370	2017
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016	370	2016
Simple recurrent units for highly parallelizable recurrence T Lei, Y Zhang, SI Wang, H Dai, Y Artzi arXiv preprint arXiv:1709.02755, 2017	353	2017
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu arXiv preprint arXiv:2005.03191, 2020	347	2020
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023	308	2023
Fleurs: Few-shot learning evaluation of universal representations of speech A Conneau, M Ma, S Khanuja, Y Zhang, V Axelrod, S Dalmia, J Riesa, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 798-805, 2023	304	2023

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–20

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả