Mohammad Ghavamzadeh

Trích dẫn bởi

	Tất cả	Từ 2020
Trích dẫn	15779	11431
h-index	58	46
i10-index	128	119

3100

1550

775

2325

2006200720082009201020112012201320142015201620172018201920202021202220232024202550 60 69 100 107 183 228 198 271 301 361 416 584 915 1200 1674 2116 2829 3088 511

Truy cập công khai

Xem tất cả

20 bài viết

0 bài viết

có sẵn

không có sẵn

Dựa trên yêu cầu tài trợ

Đồng tác giả

Yinlam ChowResearch Scientist, Google ResearchEmail được xác minh tại google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchEmail được xác minh tại inria.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaEmail được xác minh tại technion.ac.il
Branislav KvetonAdobe ResearchEmail được xác minh tại adobe.com
Sridhar MahadevanDirector, Adobe Research & Professor, University of Massachusetts, AmherstEmail được xác minh tại cs.umass.edu
Rémi MunosFAIR, MetaEmail được xác minh tại inria.fr
Csaba SzepesvariDeepMind & University of AlbertaEmail được xác minh tại cs.ualberta.ca
Craig BoutilierPrincipal Scientist, GoogleEmail được xác minh tại google.com
Georgios TheocharousAdobe ResearchEmail được xác minh tại adobe.com
Marek PetrikUniversity of New HampshireEmail được xác minh tại cs.unh.edu
Amir-massoud FarahmandPolytechnique Montreal, Mila, University of TorontoEmail được xác minh tại cs.toronto.edu
Ofir NachumOpenAIEmail được xác minh tại openai.com
Philip ThomasUniversity of Massachusetts AmherstEmail được xác minh tại cs.umass.edu
Hung BuiResearch Scientist, Google DeepMindEmail được xác minh tại google.com
Zheng WenGoogle DeepMindEmail được xác minh tại google.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceEmail được xác minh tại iisc.ac.in
Richard S. SuttonKeen, Amii, and University of AlbertaEmail được xác minh tại richsutton.com
Yasin Abbasi YadkoriGoogle DeepMindEmail được xác minh tại google.com
Aviv TamarTechnionEmail được xác minh tại technion.ac.il
Yonathan EfroniMeta, New YorkEmail được xác minh tại fb.com

Theo dõi

Mohammad Ghavamzadeh

Amazon AGI

Email được xác minh tại amazon.com - Trang chủ

Reinforcement Learning Online Learning Machine Learning Control AI


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ... Information Fusion, 2021	2527	2021
Natural Actor–critic Algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Automatica 45 (11), 2471-2482, 2009	1133*	2009
Risk-constrained Reinforcement Learning with Percentile Risk Criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017	650	2017
A Lyapunov-based Approach to Safe Reinforcement Learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Neural Information Processing Systems, 8103-8112, 2018	633	2018
Bayesian Reinforcement Learning: A Survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends in Machine Learning 8 (5-6), 359-483, 2015	596	2015
Algorithms for CVaR Optimization in MDPs Y Chow, M Ghavamzadeh Advances in Neural Information Processing Systems, 3509-3517, 2014	411	2014
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence V Gabillon, M Ghavamzadeh, A Lazaric Neural Information Processing Systems, 3221-3229, 2012	379	2012
High-confidence Off-policy Evaluation P Thomas, G Theocharous, M Ghavamzadeh AAAI, 3000-3006, 2015	346	2015
Actor-Critic Algorithms for Risk-sensitive MDPs LA Prashanth, M Ghavamzadeh Neural Information Processing Systems, 252-260, 2013	339*	2013
Safe Policy Learning for Continuous Control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh Conference on Robot Learning (CoRL), 2020	314*	2020
More Robust Doubly Robust Off-policy Evaluation M Farajtabar, Y Chow, M Ghavamzadeh ICML, 1447-1456, 2018	304	2018
Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023	265	2023
High Confidence Policy Improvement P Thomas, G Theocharous, M Ghavamzadeh ICML, 2380-2388, 2015	229	2015
Benchmarking Batch Deep Reinforcement Learning Algorithms S Fujimoto, E Conti, M Ghavamzadeh, J Pineau arXiv preprint arXiv:1910.01708, 2019	222	2019
Speedy Q-learning M Ghavamzadeh, H Kappen, M Azar, R Munos Neural Information Processing Systems 24, 2411-2419, 2011	221*	2011
Personalized Ad Recommendation Systems for Life-time Value Optimization with Guarantees G Theocharous, PS Thomas, M Ghavamzadeh IJCAI, 1806-1812, 2015	209*	2015
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ... Advances in Neural Information Processing Systems, 2024	204*	2024
Supervised actor-critic reinforcement learning MT Rosenstein, AG Barto, J Si, A Barto, W Powell, D Wunsch Learning and approximate dynamic programming: scaling up to the real world …, 2004	203	2004
Hierarchical Multi-agent Reinforcement Learning R Makar, S Mahadevan, M Ghavamzadeh International Conference on Autonomous Agents, 246-253, 2001	195	2001
Hierarchical Multi-agent Reinforcement Learning M Ghavamzadeh, S Mahadevan, R Makar Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) 13 (2), 197-229, 2006	183	2006

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–20

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả