Samson Tan

Cited by

	All	Since 2019
Citations	2459	2459
h-index	14	14
i10-index	17	17

1100

550

275

825

202020212022202320249 94 208 1047 1084

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shafiq JotyResearch Director at Salesforce Research, Assoc. Prof. at NTU (on leave)Verified email at ntu.edu.sg
Min-Yen Kan (靳民彦)Associate Professor, National University of SingaporeVerified email at comp.nus.edu.sg
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Nazneen RajaniHugging FaceVerified email at huggingface.co
Chien-Sheng (Jason) WuSalesforce AI ResearchVerified email at salesforce.com
Karan GoelCartesiaVerified email at stanford.edu
Richard Socheryou.comVerified email at stanford.edu
Lav R. VarshneyUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu

Samson Tan

Applied Scientist at Amazon AGI

Verified email at amazon.com - Homepage

Natural Language Processing Ethical AI Adversarial Robustness Linguistic Variation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1622	2023
Robustness gym: Unifying the NLP evaluation landscape K Goel, N Rajani, J Vig, S Tan, J Wu, S Zheng, C Xiong, M Bansal, C Ré arXiv preprint arXiv:2101.04840, 2021	142	2021
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations S Tan, S Joty, MY Kan, R Socher The 58th Annual Meeting of the Association for Computational Linguistics …, 2020	113	2020
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022	104	2022
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021	102*	2021
Nl-augmenter: A framework for task-sensitive natural language augmentation KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ... arXiv preprint arXiv:2112.02721, 2021	72	2021
Data governance in the age of large-scale data-driven language technology Y Jernite, H Nguyen, S Biderman, A Rogers, M Masoud, V Danchev, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	68	2022
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding S Tan, S Joty, LR Varshney, MY Kan The 2020 Conference on Empirical Methods in Natural Language Processing, 2020	43	2020
Reliability Testing for Natural Language Processing Systems S Tan, S Joty, K Baxter, A Taeihagh, GA Bennett, MY Kan The Joint Conference of the 59th Annual Meeting of the Association for …, 2021	37	2021
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots S Tan, S Joty 2021 Annual Conference of the North American Chapter of the Association for …, 2021	30	2021
Recode: Robustness evaluation of code generation models S Wang, Z Li, H Qian, C Yang, Z Wang, M Shang, V Kumar, S Tan, B Ray, ... arXiv preprint arXiv:2212.10264, 2022	24	2022
Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, and Bing Xiang. Recode: Robustness evaluation of code generation models S Wang, Z Li, H Qian, C Yang, Z Wang, M Shang, V Kumar, S Tan Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	21	2023
Interpreting the robustness of neural NLP models to textual perturbations Y Zhang, L Pan, S Tan, MY Kan arXiv preprint arXiv:2110.07159, 2021	19	2021
Large language models of code fail at completing code with potential bugs T Dinh, J Zhao, S Tan, R Negrinho, L Lausen, S Zha, G Karypis Advances in Neural Information Processing Systems 36, 2024	16	2024
Lessons from the Trenches on Reproducible Evaluation of Language Models S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ... arXiv preprint arXiv:2405.14782, 2024	14	2024
The risks of machine learning systems S Tan, A Taeihagh, K Baxter arXiv preprint arXiv:2204.09852, 2022	12	2022
Whodunit? Learning to Contrast for Authorship Attribution B Ai, Y Wang, Y Tan, S Tan arXiv preprint arXiv:2209.11887, 2022	11	2022
TraVLR: Now You See It, Now You Don’t! A Bimodal Dataset for Evaluating Visio-Linguistic Reasoning KJ Chow, S Tan, MY Kan Proceedings of the 17th Conference of the European Chapter of the …, 2023	3	2023
Automatic Feature Fairness in Recommendation via Adversaries H Hu, Y Cao, Z He, S Tan, MY Kan Proceedings of the Annual International ACM SIGIR Conference on Research and …, 2023	2	2023
Linguistically-Inclusive Natural Language Processing S Tan Ph. D. Dissertation. National University of Singapore, 2022	2	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors