Artikel mit Open-Access-Mandaten - Mohammad GhavamzadehWeitere Informationen
Verfügbar: 20
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ...
Information Fusion, 2021
Mandate: Australian Research Council, Natural Sciences and Engineering Research …
Risk-constrained Reinforcement Learning with Percentile Risk Criteria
Y Chow, M Ghavamzadeh, L Janson, M Pavone
Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017
Mandate: US Department of Defense, US National Institutes of Health
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ...
Advances in Neural Information Processing Systems, 2024
Mandate: US National Science Foundation
Policy Gradient for Coherent Risk Measures
A Tamar, Y Chow, M Ghavamzadeh, S Mannor
Advances in Neural Information Processing Systems, 1468-1476, 2015
Mandate: European Commission
Model-independent Online Learning for Influence Maximization
S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ...
ICML, 3530-3539, 2017
Mandate: Natural Sciences and Engineering Research Council of Canada
Robust reinforcement learning using offline data
K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh
Advances in neural information processing systems 35, 32211-32224, 2022
Mandate: US National Science Foundation
Variance-constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs
LA Prashanth, M Ghavamzadeh
Machine Learning 105 (3), 367-417, 2016
Mandate: US National Science Foundation
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
AISTATS, 1004-1012, 2016
Mandate: Fonds zur Förderung der wissenschaftlichen Forschung, Australian Research …
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
T Xie, B Liu, Y Xu, M Ghavamzadeh, Y Chow, D Lyu, D Yoon
Advances in Neural Information Processing Systems, 1073-1083, 2018
Mandate: US National Science Foundation
A Review of Deep Learning for Video Captioning
M Abdar, M Kollati, S Kuraparthi, F Pourpanah, D McDuff, ...
arXiv preprint arXiv:2304.11431, 2023
Mandate: Australian Research Council
Entropic Risk Optimization in Discounted MDPs
JL Hau, M Petrik, M Ghavamzadeh
International Conference on Artificial Intelligence and Statistics, 47-76, 2023
Mandate: US National Science Foundation
Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity
B Liu, I Gemp, M Ghavamzadeh, J Liu, S Mahadevan, M Petrik
Journal of Artificial Intelligence Research (JAIR) 63, 461-494, 2018
Mandate: US National Science Foundation
Feature and parameter selection in stochastic linear bandits
A Moradipari, B Turan, Y Abbasi-Yadkori, M Alizadeh, M Ghavamzadeh
International Conference on Machine Learning, 15927-15958, 2022
Mandate: US National Science Foundation
Collaborative Multi-agent Stochastic Linear Bandits
A Moradipari, M Ghavamzadeh, M Alizadeh
2022 American Control Conference (ACC), 2761-2766, 2022
Mandate: US National Science Foundation
On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
JL Hau, E Delage, M Ghavamzadeh, M Petrik
Advances in Neural Information Processing Systems 36, 2024
Mandate: US National Science Foundation, Natural Sciences and Engineering Research …
Operator Splitting Value Iteration
A Rakhsha, A Wang, M Ghavamzadeh, A Farahmand
Advances in Neural Information Processing Systems 35, 38373-38385, 2022
Mandate: Natural Sciences and Engineering Research Council of Canada
Multi-environment meta-learning in stochastic linear bandits
A Moradipari, M Ghavamzadeh, T Rajabzadeh, C Thrampoulidis, ...
2022 IEEE International Symposium on Information Theory (ISIT), 1659-1664, 2022
Mandate: US National Science Foundation
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
J Mei, B Dai, A Agarwal, M Ghavamzadeh, C Szepesvári, D Schuurmans
Advances in Neural Information Processing Systems 36, 2024
Mandate: Natural Sciences and Engineering Research Council of Canada
Distributionally robust behavioral cloning for robust imitation learning
K Panaganti, Z Xu, D Kalathil, M Ghavamzadeh
2023 62nd IEEE Conference on Decision and Control (CDC), 1342-1347, 2023
Mandate: US National Science Foundation
On Dynamic Program Decompositions of Static Risk Measures
JL Hau, E Delage, M Ghavamzadeh, M Petrik
Les Cahiers du GERAD ISSN 711, 2440, 2023
Mandate: Fonds de recherche du Québec - Nature et technologies
Angaben zur Publikation und Finanzierung werden automatisch von einem Computerprogramm ermittelt