Shuming Ma

Zitiert von

	Alle	Seit 2019
Zitate	5588	5402
h-index	40	39
i10-index	68	66

2500

1250

625

1875

2017201820192020202120222023202423 154 219 295 423 587 1330 2474

Öffentlicher Zugriff

Alle anzeigen

17 Artikel

1 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Furu WeiPartner Research Manager, Microsoft ResearchBestätigte E-Mail-Adresse bei microsoft.com
Xu SunAssociate Professor, Peking UniversityBestätigte E-Mail-Adresse bei pku.edu.cn
houfeng wangPeking UniversityBestätigte E-Mail-Adresse bei pku.edu.cn
Junyang LinQwen Team, Alibaba Group & Peking UniversityBestätigte E-Mail-Adresse bei alibaba-inc.com
Lei CuiMicrosoft Research AsiaBestätigte E-Mail-Adresse bei microsoft.com
Tianyu LiuAlibabaBestätigte E-Mail-Adresse bei pku.edu.cn
Jingjing XuShanghai AI LabBestätigte E-Mail-Adresse bei pku.edu.cn
Wenjie LiThe Hong Kong Polytechnic UniversityBestätigte E-Mail-Adresse bei comp.polyu.edu.hk
Sujian LIPeking Univ.Bestätigte E-Mail-Adresse bei pku.edu.cn
Yizhong WangUniversity of WashingtonBestätigte E-Mail-Adresse bei cs.washington.edu

Folgen

Shuming Ma

Microsoft Research Asia

Bestätigte E-Mail-Adresse bei microsoft.com - Startseite

Natural language processing deep learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Kosmos-2: Grounding multimodal large language models to the world Z Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, F Wei arXiv preprint arXiv:2306.14824, 2023	504	2023
SGM: sequence generation model for multi-label classification P Yang, X Sun, W Li, S Ma, W Wu, H Wang arXiv preprint arXiv:1806.04822, 2018	486	2018
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Advances in Neural Information Processing Systems 36, 72096-72109, 2023	425	2023
Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei arXiv preprint arXiv:2212.10559, 2022	319	2022
Retentive network: A successor to transformer for large language models Y Sun, L Dong, S Huang, S Ma, Y Xia, J Xue, J Wang, F Wei arXiv preprint arXiv:2307.08621, 2023	246	2023
meprop: Sparsified back propagation for accelerated deep learning with reduced overfitting X Sun, X Ren, S Ma, H Wang International Conference on Machine Learning, 3299-3308, 2017	197	2017
Global encoding for abstractive summarization J Lin, X Sun, S Ma, Q Su arXiv preprint arXiv:1805.03989, 2018	194	2018
Deepnet: Scaling transformers to 1,000 layers H Wang, S Ma, L Dong, S Huang, D Zhang, F Wei IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	154	2024
A length-extrapolatable transformer Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ... arXiv preprint arXiv:2212.10554, 2022	134	2022
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA Z Chi arXiv preprint arXiv:2106.16138, 2021	130	2021
The era of 1-bit llms: All large language models are in 1.58 bits S Ma, H Wang, L Ma, L Wang, W Wang, S Huang, L Dong, R Wang, J Xue, ... arXiv preprint arXiv:2402.17764, 2024	128	2024
Longnet: Scaling transformers to 1,000,000,000 tokens J Ding, S Ma, L Dong, X Zhang, S Huang, W Wang, N Zheng, F Wei arXiv preprint arXiv:2307.02486, 2023	126	2023
Language models are general-purpose interfaces Y Hao, H Song, L Dong, S Huang, Z Chi, W Wang, S Ma, F Wei arXiv preprint arXiv:2206.06336, 2022	104	2022
A simple and effective unified encoder for document-level machine translation S Ma, D Zhang, M Zhou Proceedings of the 58th annual meeting of the association for computational …, 2020	101	2020
Alternating language modeling for cross-lingual pre-training J Yang, S Ma, D Zhang, S Wu, Z Li, M Zhou Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9386-9393, 2020	89	2020
A whole-slide foundation model for digital pathology from real-world data H Xu, N Usuyama, J Bagga, S Zhang, R Rao, T Naumann, C Wong, ... Nature, 1-8, 2024	85	2024
Improving semantic relevance for sequence-to-sequence learning of chinese social media text summarization S Ma, X Sun, J Xu, H Wang, W Li, Q Su arXiv preprint arXiv:1706.02459, 2017	82	2017
mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs Z Chi arXiv preprint arXiv:2104.08692, 2021	80	2021
Semantic-unit-based dilated convolution for multi-label text classification J Lin, Q Su, P Yang, S Ma, X Sun arXiv preprint arXiv:1808.08561, 2018	79	2018
Query and output: Generating words by querying distributed word representations for paraphrase generation S Ma, X Sun, W Li, S Li, W Li, X Ren arXiv preprint arXiv:1803.01465, 2018	78	2018

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren