Kaitao Song

Cited by

	All	Since 2019
Citations	9808	9806
h-index	19	19
i10-index	30	30

4100

2050

1025

3075

20192020202120222023202465 262 582 1620 3047 4091

Public access

View all

9 articles

1 article

available

not available

Based on funding mandates

Co-authors

Xu TanPrincipal Researcher and Research Manager, MicrosoftVerified email at microsoft.com
Tao QinPartner Research Manager, Microsoft ResearchVerified email at microsoft.com
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Wenhai Wang (王文海)CUHK | Shanghai AI Laboratory | NJUVerified email at cuhk.edu.hk
Xiang Li（李翔）Associate Professor, Nankai UniversityVerified email at nankai.edu.cn
Yongliang ShenZhejiang UniversityVerified email at zju.edu.cn
Renqian LuoSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Jin XuQwen Team, Alibaba GroupVerified email at alibaba-inc.com
Yi Ren (任意)Research Scientist, TiktokVerified email at bytedance.com
Xiu-Shen WeiProfessor, Southeast UniversityVerified email at seu.edu.cn
Xiangbo Shu (舒祥波)Professor, Nanjing University of Science and TechnologyVerified email at njust.edu.cn
Yicheng ZouShanghai AI LaboratoryVerified email at pjlab.org.cn
Hao SunPeking UniversityVerified email at pku.edu.cn
Di HePeking UniversityVerified email at pku.edu.cn
Dongsheng LiMicrosoft Research AsiaVerified email at microsoft.com
Yezhen WangNational University of SingaporeVerified email at comp.nus.edu.sg

Kaitao Song

Senior Researcher, Microsoft Research

Verified email at microsoft.com - Homepage

Natural Language Processing Large Language Models Artificial General Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao ICCV 2021, 2021	4241	2021
Pvt v2: Improved baselines with pyramid vision transformer W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao Computational Visual Media 8 (3), 415-424, 2022	1456	2022
Mass: Masked sequence to sequence pre-training for language generation K Song, X Tan, T Qin, J Lu, TY Liu ICML 2019, 2019	1173	2019
Mpnet: Masked and permuted pre-training for language understanding K Song, X Tan, T Qin, J Lu, TY Liu NeurIPS 2020, 2020	1105	2020
HuggingGPT: Solving AI tasks with ChatGPT and Its Friends in Huggingface Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang NeurIPS 2023, 2023	941	2023
Connecting large language models with evolutionary algorithms yields powerful prompt optimizers Q Guo, R Wang, J Guo, B Li, K Song, X Tan, G Liu, J Bian, Y Yang ICLR 2024, 2023	136	2023
NaturalSpeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... ICML 2024, 2024	83	2024
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search J Xu, X Tan, R Luo, K Song, J Li, T Qin, TY Liu KDD 2021, 2021	80	2021
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint Z Sheng, K Song, X Tan, Y Ren, W Ye, S Zhang, T Qin AAAI 2021, 2020	67	2020
Bi-modal progressive mask attention for fine-grained recognition K Song, XS Wei, X Shu, RJ Song, J Lu IEEE Transactions on Image Processing 29, 7006-7018, 2020	64	2020
DiffusionNER: Boundary Diffusion for Named Entity Recognition Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang ACL 2023, 2023	45	2023
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling L Xue, K Song, D Wu, X Tan, NL Zhang, T Qin, WQ Zhang, TY Liu ACL 2021, 2021	38	2021
Generating adversarial examples with conditional generative adversarial net P Yu, K Song, J Lu 2018 24th international conference on pattern recognition (ICPR), 676-681, 2018	36	2018
Analyzing and Mitigating Interference in Neural Architecture Search J Xu, X Tan, K Song, R Luo, Y Leng, T Qin, TY Liu, J Li ICML 2022, 2021	32	2021
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... ICLR 2024, 2023	30	2023
Easytool: Enhancing llm-based agents with concise tool instruction S Yuan, K Song, J Chen, X Tan, Y Shen, R Kan, D Li, D Yang ICLR 2024 Workshop on LLM Agents, 2024	26	2024
Taskbench: Benchmarking large language models for task automation Y Shen, K Song, X Tan, W Zhang, K Ren, S Yuan, W Lu, D Li, Y Zhuang NeurIPS 2024, 2023	23	2023
Mixed-phoneme bert: Improving bert with mixed phoneme and sup-phoneme representations for text to speech G Zhang, K Song, X Tan, D Tan, Y Yan, Y Liu, G Wang, W Zhou, T Qin, ... INTERSPEECH 2022, 2022	23	2022
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition Y Leng, X Tan, W Liu, K Song, R Wang, XY Li, T Qin, E Lin, TY Liu AAAI 2023, 2022	20	2022
LightPAFF: A two-stage distillation framework for pre-training and fine-tuning K Song, H Sun, X Tan, T Qin, J Lu, H Liu, TY Liu arXiv preprint arXiv:2004.12817, 2020	19	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors