팔로우
Wesley Chung
Wesley Chung
mail.mcgill.ca의 이메일 확인됨
제목
인용
인용
연도
Two-timescale networks for nonlinear value function approximation
W Chung
522019
Importance resampling for off-policy prediction
M Schlegel, W Chung, D Graves, J Qian, M White
Advances in Neural Information Processing Systems 32, 2019
432019
Beyond variance reduction: Understanding the true impact of baselines on policy optimization
W Chung, V Thomas, MC Machado, N Le Roux
International Conference on Machine Learning, 1999-2009, 2021
302021
The role of baselines in policy gradient optimization
J Mei, W Chung, V Thomas, B Dai, C Szepesvari, D Schuurmans
Advances in Neural Information Processing Systems 35, 17818-17830, 2022
182022
High-confidence error estimates for learned value functions
T Sajed, W Chung, M White
arXiv preprint arXiv:1808.09127, 2018
82018
Incrementally Learning Functions of the Return
B Bennett, W Chung, M Zaheer, V Liu
arXiv preprint arXiv:1907.04651, 2019
12019
Parseval Regularization for Continual Reinforcement Learning
W Chung, L Cherif, D Precup, D Meger
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0
Offline-Online Reinforcement Learning: Extending Batch and Online RL
M Hashemzadeh, W Chung, M White
Importance Resampling for Off-policy Policy Evaluation
M Schlegel, W Chung, D Graves, M White
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–9