Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu arXiv preprint arXiv:1609.08144, 2016 | 9411 | 2016 |
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 3315 | 2018 |
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 3306 | 2020 |
Tacotron: Towards end-to-end speech synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2503* | 2017 |
Google’s multilingual neural machine translation system: Enabling zero-shot translation M Johnson, M Schuster, QV Le, M Krikun, Y Wu, Z Chen, N Thorat, ... Transactions of the Association for Computational Linguistics 5, 339-351, 2017 | 2410 | 2017 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2070 | 2023 |
Gpipe: Efficient training of giant neural networks using pipeline parallelism Y Huang, Y Cheng, A Bapna, O Firat, D Chen, M Chen, HJ Lee, J Ngiam, ... Advances in neural information processing systems 32, 2019 | 1760 | 2019 |
State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 1474 | 2018 |
Exploring the limits of language modeling R Jozefowicz, O Vinyals, M Schuster, N Shazeer, Y Wu arXiv preprint arXiv:1602.02410, 2016 | 1441 | 2016 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1400 | 2023 |
Coca: Contrastive captioners are image-text foundation models J Yu, Z Wang, V Vasudevan, L Yeung, M Seyedhosseini, Y Wu arXiv preprint arXiv:2205.01917, 2022 | 1288 | 2022 |
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018 | 996 | 2018 |
Scaling autoregressive models for content-rich text-to-image generation J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang, V Vasudevan, A Ku, Y Yang, ... arXiv preprint arXiv:2206.10789 2 (3), 5, 2022 | 989 | 2022 |
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019 | 945 | 2019 |
Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 748 | 2019 |
Development and implementation of high-throughput SNP genotyping in barley TJ Close, PR Bhat, S Lonardi, Y Wu, N Rostoks, L Ramsay, A Druka, ... BMC genomics 10, 1-13, 2009 | 743 | 2009 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 614 | 2024 |
Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph Y Wu, PR Bhat, TJ Close, S Lonardi PLoS genetics 4 (10), e1000212, 2008 | 606 | 2008 |
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022 | 540 | 2022 |
The best of both worlds: Combining recent advances in neural machine translation MX Chen, O Firat, A Bapna, M Johnson, W Macherey, G Foster, L Jones, ... arXiv preprint arXiv:1804.09849, 2018 | 539 | 2018 |