Yaodong Yang

Cited by

	All	Since 2019
Citations	6869	6745
h-index	39	39
i10-index	82	82

3200

1600

800

2400

2017201820192020202120222023202430 81 173 322 553 902 1562 3195

Public access

View all

30 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jiaming Ji (吉嘉铭)Peking UniversityVerified email at stu.pku.edu.cn
Weinan ZhangProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
David MguniLecturer, Computer Science, Queen Mary University of LondonVerified email at qmul.ac.uk
Josef DaiZhejiang UniversityVerified email at zju.edu.cn
Stephen McAleerOpenAIVerified email at openai.com
Jakub Grudzien KubaUC BerkeleyVerified email at berkeley.edu
Yuanpei ChenSouth China University of TechnologyVerified email at stanford.edu
Yiran GengTuring Class, Peking UniversityVerified email at stu.pku.edu.cn
Nicolas Perez-NievesResearch Engineer, DeepMindVerified email at google.com
Haitham Bou-AmmarRL-Team Leader, BO-Team Leader, MAS-Team Leader @ Huawei London & H. Assistant Professor @ UCLVerified email at huawei.com
Xiaotie DengChair Professor of Computer Science, Peking University, Beijing, ChinaVerified email at pku.edu.cn
Jieping Ye, IEEE Fellow & ACM Distin...Alibaba GroupVerified email at umich.edu
Matthew E. TaylorProfessor, University of AlbertaVerified email at ualberta.ca

Yaodong Yang

BOYA (博雅) Assistant Professor at Peking University

Verified email at pku.edu.cn - Homepage

Reinforcement Learning AI Alignment Embodied AI Multi-Agent Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mean field multi-agent reinforcement learning Y Yang, R Luo, M Li, M Zhou, W Zhang, J Wang ICML 2018, Long Talk, 5571-5580, 2018	817	2018
Multiagent bidirectionally-coordinated nets: Emergence of human-level coordination in learning to play starcraft combat games P Peng, Y Wen, Y Yang, Q Yuan, Z Tang, H Long, J Wang NeurIPS 2017 Workshop: Emergent Communication, 2017	608	2017
Baichuan 2: Open Large-scale Language Models A Yang, B Xiao, B Wang, B Zhang, C Yin, C Lv, D Pan, D Wang, D Yan, ... arXiv preprint arXiv:2309.10305, 2023	445*	2023
An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective Y Yang, J Wang arXiv preprint arXiv:2011.00583, 2020	332	2020
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning M Li, Y Jiao, T Qin, Y Yang, Z Gong, J Wang, C Wang, G Wu, J Ye WWW 2019 (oral), 2019	307	2019
A Review of Safe Reinforcement Learning: Methods, Theory and Applications S Gu, L Yang, Y Du, G Chen, F Walter, J Wang, Y Yang, A Knoll arXiv preprint arXiv:2205.10330, 2022	273	2022
Beavertails: Towards improved safety alignment of llm via a human-preference dataset J Ji, M Liu, J Dai, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang NeurIPS 2023, 2023	251	2023
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning JG Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang ICLR 2022, 2021	246	2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving M Zhou, J Luo, J Villela, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... Conference on Robotic Learning 2020 (Best System Paper Award), 2020	215*	2020
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem M Wen, JG Kuba, R Lin, W Zhang, Y Wen, J Wang, Y Yang NeurIPS 2022, 2022	179	2022
Ai alignment: A comprehensive survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	177	2023
Safe RLHF: Safe Reinforcement Learning from Human Feedback J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang arXiv preprint arXiv:2310.12773, 2023	174	2023
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning Y Wen, Y Yang, R Luo, J Wang, W Pan ICLR 2019, 2019	173	2019
Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting A Kim, Y Yang, S Lessmann, T Ma, MC Sung, JEV Johnson European Journal of Operational Research 283 (1), 217-234, 2020	129	2020
Bi-level Actor-Critic for Multi-agent Coordination H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang AAAI 2020, 2019	96	2019
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... NeurIPS 2022, 2022	94	2022
Multi-Agent Determinantal Q-Learning Y Yang, Y Wen, L Chen, J Wang, K Shao, D Mguni, W Zhang ICML 2020, 2020	80	2020
Factorized Q-learning for large-scale multi-agent systems M Zhou, Y Chen, Y Wen, Y Yang, Y Su, W Zhang, D Zhang, J Wang International Conference on Distributed Artificial Intelligence, 1-7, 2019	77	2019
Offline Pre-trained Multi-agent Decision Transformer L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ... Machine Intelligence Research 20 (2), 233-248, 2023	72	2023
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning Y Wen, Y Yang, R Luo, J Wang IJCAI 2020, 2019	71	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors