Kaisheng Yao

Cited by

	All	Since 2019
Citations	10719	7197
h-index	33	29
i10-index	66	42

3100

1550

775

2325

201220132014201520162017201820192020202120222023202442 77 172 359 716 738 1108 1013 917 827 715 679 3005

Public access

View all

4 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
George ZweigVerified email at mit.edu
Yifan GongPrincipal Science Manager, Microsoft Corp.Verified email at microsoft.com
Frank SeideMicrosoft`Verified email at microsoft.com
Baolin PengMicrosoft Research, RedmondVerified email at microsoft.com
Yangyang ShiMetaVerified email at fb.com
Xiaodong HeAI Lab, JD.com; IEEE/CAAI FellowVerified email at ieee.org
Yu ZhangOpenAIVerified email at csail.mit.edu
Satoshi NakamuraNara Institute of Science and TechnologyVerified email at is.naist.jp
Mike SeltzerFacebookVerified email at fb.com
Hang SuTech Lead Manager at Amazon Inc.Verified email at amazon.com
Kuldip PaliwalProfessor (Chair, Communication and Information Engineering), Griffith University, BrisbaneVerified email at griffith.edu.au
Xiaolong LiAlibaba Group (previously)Verified email at alibaba-inc.com
Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Grégoire MesnilPhD Student at Université de MontréalVerified email at umontreal.ca
Huaming WangPartner Group Engineering Manager, MicrosoftVerified email at microsoft.com
Chris DyerDeepMind, Carnegie MellonVerified email at google.com
Kat VylomovaUniversity of MelbourneVerified email at unimelb.edu.au
Trevor CohnGoogle Research (Research Scientist) & University of Melbourne (Professor; School of CIS)Verified email at unimelb.edu.au
Te-Won LeeVerified email at ucsd.edu

Kaisheng Yao

Google

Verified email at google.com - Homepage

foundation models natural language processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023	2070	2023
Recent advances in deep learning for speech research at Microsoft L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ... 2013 IEEE international conference on acoustics, speech and signal …, 2013	1063	2013
Using recurrent neural networks for slot filling in spoken language understanding G Mesnil, Y Dauphin, K Yao, Y Bengio, L Deng, D Hakkani-Tur, X He, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (3), 530-539, 2014	774	2014
CNTK: Microsoft's open-source deep-learning toolkit F Seide, A Agarwal Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016	651	2016
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	614	2024
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition D Yu, K Yao, H Su, G Li, F Seide 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	525	2013
An introduction to computational networks and the computational network toolkit D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ... Microsoft Technical Report MSR-TR-2014–112, 2014	475	2014
Recurrent neural networks for language understanding. K Yao, G Zweig, MY Hwang, Y Shi, D Yu Interspeech, 2524-2528, 2013	421	2013
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi 2014 IEEE spoken language technology workshop (SLT), 189-194, 2014	409	2014
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016	367	2016
Assignment of semantic labels to a sequence of words using neural network architectures A Deoras, K Yao, X He, L Deng, GG Zweig, R Sarikaya, D Yu, MY Hwang, ... US Patent 10,867,597, 2020	298	2020
Adaptation of context-dependent deep neural networks for automatic speech recognition K Yao, D Yu, F Seide, H Su, L Deng, Y Gong 2012 IEEE Spoken Language Technology Workshop (SLT), 366-369, 2012	257	2012
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion K Yao, G Zweig arXiv preprint arXiv:1506.00196, 2015	204	2015
Incorporating structural alignment biases into an attentional neural translation model T Cohn, CDV Hoang, E Vymolova, K Yao, C Dyer, G Haffari arXiv preprint arXiv:1601.01085, 2016	200	2016
System and method for text-to-phoneme mapping with prior knowledge K Yao US Patent App. 11/278,497, 2007	171	2007
Recurrent conditional random field for language understanding K Yao, B Peng, G Zweig, D Yu, X Li, F Gao 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	166	2014
Hyper-structure recurrent neural networks for text-to-speech P Zhao, M Leung, K Yao, B Yan, S Zhao, FA Alleva US Patent 10,127,901, 2018	151	2018
Attention with intention for a neural network conversation model K Yao, G Zweig, B Peng arXiv preprint arXiv:1510.08565, 2015	145	2015
Depth-gated LSTM K Yao, T Cohn, K Vylomova, K Duh, C Dyer arXiv preprint arXiv:1508.03790, 2015	129	2015
Depth-gated recurrent neural networks K Yao, T Cohn, K Vylomova, K Duh, C Dyer arXiv preprint arXiv:1508.03790 9, 98, 2015	114	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors