Theo dõi
Tadashi Kozuno
Tadashi Kozuno
OMRON SINIC X
Email được xác minh tại alumni.oist.jp - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning
N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist
The 34th Conference on Neural Information Processing Systems, 2020
139*2020
Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
T Kozuno, P Ménard, R Munos, M Valko
Advances in Neural Information Processing Systems 35, 2021
49*2021
Theoretical analysis of efficiency and robustness of softmax and gap-increasing operators in reinforcement learning
T Kozuno, E Uchibe, K Doya
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
442019
Greedification operators for policy optimization: Investigating forward and reverse kl divergences
A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White
Journal of Machine Learning Research 23 (253), 1-79, 2022
352022
Revisiting Peng's Q () for Modern Reinforcement Learning
T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ...
The 38th International Conference on Machine Learning, 2021
252021
Robust Markov Decision Processes without Model Estimation
W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang
arXiv preprint arXiv:2302.01248, 2023
21*2023
Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms
H Furuta, T Kozuno, T Matsushima, Y Matsuo, SS Gu
Advances in Neural Information Processing Systems 35, 2021
21*2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
H Furuta, T Matsushima, T Kozuno, Y Matsuo, S Levine, O Nachum, ...
The 38th International Conference on Machine Learning, 2021
212021
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
K Kasaura, S Miura, T Kozuno, R Yonetani, K Hoshino, Y Hosoe
IEEE Robotics and Automation Letters, 2023
152023
Adapting to game trees in zero-sum imperfect information games
C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko
International Conference on Machine Learning, 10093-10135, 2023
142023
Confident Approximate Policy Iteration for Efficient Local Planning in -realizable MDPs
G Weisz, A György, T Kozuno, C Szepesvári
Advances in Neural Information Processing Systems 35, 25547-25559, 2022
122022
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Y Tang*, T Kozuno*, M Rowland, R Munos, M Valko
Advances in Neural Information Processing Systems 35, 2021
122021
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ...
Transactions on Machine Learning Research, 2022
112022
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ...
arXiv preprint arXiv:2205.14211, 2022
102022
Symmetry-aware reinforcement learning for robotic assembly under partial observability with a soft wrist
H Nguyen, T Kozuno, CC Beltran-Hernandez, M Hamaya
2024 IEEE international conference on robotics and automation (ICRA), 9369-9375, 2024
92024
Variational oracle guiding for reinforcement learning
D Han, T Kozuno, X Luo, ZY Chen, K Doya, Y Yang, D Li
International Conference on Learning Representations, 2021
92021
Study of White-LED Using Amorphous Carbon Nitride Grown by RF-sputtering and ECR-plasma CVD
T Kozuno, S Kishimoto, K Tachibana, K Itoh, Y Iwano, S Kunitsugu, ...
Journal of Light & Visual Environment 35 (1), 86-89, 2011
72011
When to replan? an adaptive replanning strategy for autonomous navigation using deep reinforcement learning
K Honda, R Yonetani, M Nishimura, T Kozuno
2024 IEEE International Conference on Robotics and Automation (ICRA), 6650-6656, 2024
32024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
T Kitamura, T Kozuno, M Kato, Y Ichihara, S Nishimori, A Sannai, ...
arXiv preprint arXiv:2401.17780, 2024
32024
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
T Kozuno, D Han, K Doya
arXiv preprint arXiv:1906.07586, 2019
32019
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20