Abbas Abdolmaleki

Cited by

	All	Since 2019
Citations	4489	4237
h-index	28	26
i10-index	46	39

1200

600

300

900

2014201520162017201820192020202120222023202414 28 29 62 81 178 387 547 906 1104 1091

Public access

View all

13 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Martin RiedmillerDeepMindVerified email at google.com
Nicolas HeessDeepMindVerified email at google.com
Michael NeunertGoogle DeepMindVerified email at google.com
Thomas LampeDeepMindVerified email at google.com
Luis Paulo ReisAssociate Professor, University of PortoVerified email at fe.up.pt
Nuno LauUniversidade de AveiroVerified email at ua.pt
Yuval TassaSenior Research Scientist, Google DeepMindVerified email at google.com
Roland HafnerDeepMindVerified email at google.com
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Verified email at robot-learning.de
Noah Y. SiegelGoogle DeepMindVerified email at google.com
Josh MerelVerified email at google.com
Steven BohezGoogle DeepMindVerified email at google.com
Nima ShafiiNVIDIAVerified email at nvidia.com
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIVerified email at ias.tu-darmstadt.de
Rudolf LioutikovTT-Professor, Intuitive Robots Lab, Karlsruhe Institute of TechnologyVerified email at kit.edu
Jost Tobias SpringenbergGoogle DeepMind

Abbas Abdolmaleki

Deepmind

Verified email at google.com

Artificial Intelligence Reinforcement Learning Robotics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022	851	2022
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018	640	2018
Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018	530	2018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020	309	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	265	2020
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019	126	2019
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019	119	2019
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	118	2022
Continuous-discrete reinforcement learning for hybrid control in robotics M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ... Conference on Robot Learning, 735-751, 2020	104	2020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ... 5th Annual Conference on Robot Learning, 2021	101	2021
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 2015	98	2015
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	85	2020
Robocat: A self-improving foundation agent for robotic manipulation K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ... arXiv preprint arXiv:2306.11706, 2023	84	2023
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019	72	2019
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018	71	2018
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016	52	2016
Data-efficient hindsight off-policy option learning M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ... International Conference on Machine Learning, 11340-11350, 2021	50	2021
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020	47	2020
Deriving and improving cma-es with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017	41	2017
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009	41	2009

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors