Mohammad Ghavamzadeh

Sitert av

	Alle	Siden 2019
Sitater	14874	11445
h-indeks	59	46
i10-indeks	127	119

2800

1400

700

2100

200620072008200920102011201220132014201520162017201820192020202120222023202450 61 69 100 105 180 230 200 268 303 360 417 582 921 1204 1675 2103 2746 2748

Offentlig tilgang

Vis alle

16 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Yinlam ChowResearch Scientist, Google ResearchVerifisert e-postadresse på google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerifisert e-postadresse på inria.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerifisert e-postadresse på technion.ac.il
Branislav KvetonAdobe ResearchVerifisert e-postadresse på adobe.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerifisert e-postadresse på cs.umass.edu
Rémi MunosGoogle DeepMindVerifisert e-postadresse på inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerifisert e-postadresse på cs.ualberta.ca
Georgios TheocharousAdobe ResearchVerifisert e-postadresse på adobe.com
Craig BoutilierPrincipal Scientist, GoogleVerifisert e-postadresse på google.com
Marek PetrikUniversity of New HampshireVerifisert e-postadresse på cs.unh.edu
Amir-massoud FarahmandPolytechnique Montreal, Mila, University of TorontoVerifisert e-postadresse på cs.toronto.edu
Ofir NachumOpenAIVerifisert e-postadresse på openai.com
Philip ThomasUniversity of Massachusetts AmherstVerifisert e-postadresse på cs.umass.edu
Hung BuiResearch Scientist, Google DeepMindVerifisert e-postadresse på google.com
Zheng WenGoogle DeepMindVerifisert e-postadresse på google.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerifisert e-postadresse på iisc.ac.in
Richard S. SuttonKeen, Amii, and University of AlbertaVerifisert e-postadresse på richsutton.com
Aviv TamarTechnionVerifisert e-postadresse på technion.ac.il
Yonathan EfroniMeta, New YorkVerifisert e-postadresse på fb.com
Manzil ZaheerGoogle ResearchVerifisert e-postadresse på cmu.edu

Følg

Mohammad Ghavamzadeh

Amazon

Verifisert e-postadresse på amazon.com - Startside

Reinforcement Learning Online Learning Machine Learning Control AI


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ... Information Fusion, 2021	2237	2021
Natural Actor–critic Algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Automatica 45 (11), 2471-2482, 2009	1111*	2009
A Lyapunov-based Approach to Safe Reinforcement Learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Neural Information Processing Systems, 8103-8112, 2018	618	2018
Risk-constrained Reinforcement Learning with Percentile Risk Criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017	603	2017
Bayesian Reinforcement Learning: A Survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends in Machine Learning 8 (5-6), 359-483, 2015	575	2015
Algorithms for CVaR Optimization in MDPs Y Chow, M Ghavamzadeh Advances in Neural Information Processing Systems, 3509-3517, 2014	393	2014
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence V Gabillon, M Ghavamzadeh, A Lazaric Neural Information Processing Systems, 3221-3229, 2012	357	2012
High-confidence Off-policy Evaluation P Thomas, G Theocharous, M Ghavamzadeh AAAI, 3000-3006, 2015	334	2015
Actor-Critic Algorithms for Risk-sensitive MDPs LA Prashanth, M Ghavamzadeh Neural Information Processing Systems, 252-260, 2013	330*	2013
Safe Policy Learning for Continuous Control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh Conference on Robot Learning (CoRL), 2020	297*	2020
More Robust Doubly Robust Off-policy Evaluation M Farajtabar, Y Chow, M Ghavamzadeh ICML, 1447-1456, 2018	287	2018
High Confidence Policy Improvement P Thomas, G Theocharous, M Ghavamzadeh ICML, 2380-2388, 2015	224	2015
Speedy Q-learning M Ghavamzadeh, H Kappen, M Azar, R Munos Neural Information Processing Systems 24, 2411-2419, 2011	222*	2011
Benchmarking Batch Deep Reinforcement Learning Algorithms S Fujimoto, E Conti, M Ghavamzadeh, J Pineau arXiv preprint arXiv:1910.01708, 2019	208	2019
Personalized Ad Recommendation Systems for Life-time Value Optimization with Guarantees G Theocharous, PS Thomas, M Ghavamzadeh IJCAI, 1806-1812, 2015	207*	2015
Supervised actor-critic reinforcement learning MT Rosenstein, AG Barto, J Si, A Barto, W Powell, D Wunsch Learning and approximate dynamic programming: scaling up to the real world …, 2004	199	2004
Hierarchical Multi-agent Reinforcement Learning R Makar, S Mahadevan, M Ghavamzadeh International Conference on Autonomous Agents, 246-253, 2001	197	2001
Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023	193	2023
Finite-Sample Analysis of Proximal Gradient TD Algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik UAI, 504-513, 2015	180*	2015
Hierarchical Multi-agent Reinforcement Learning M Ghavamzadeh, S Mahadevan, R Makar Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) 13 (2), 197-229, 2006	180	2006

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere