Tadashi Kozuno

Trích dẫn bởi

	Tất cả	Từ 2020
Trích dẫn	472	460
h-index	12	11
i10-index	14	14

160

120

20192020202120222023202420254 12 54 95 110 152 37

Truy cập công khai

Xem tất cả

5 bài viết

0 bài viết

có sẵn

không có sẵn

Dựa trên yêu cầu tài trợ

Đồng tác giả

Rémi MunosFAIR, MetaEmail được xác minh tại inria.fr
Michal ValkoChief Models Officer @ Stealth Startup, Inria & MVA - Ex: Llama at Meta; Gemini and BYOL @ DeepmindEmail được xác minh tại meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Email được xác minh tại univ-lorraine.fr
Olivier PietquinEarth Species Project | ex Google DeepMind (On leave - Professor at University of Lille)Email được xác minh tại univ-lille.fr
Nino VieillardGoogle DeepMindEmail được xác minh tại google.com
Pierre MénardOvGU MagdeburgEmail được xác minh tại inria.fr
Yunhao TangResearch Scientist, Llama research team; Previously, DeepMindEmail được xác minh tại columbia.edu
Kenji DoyaOkinawa Institute of Science and TechnologyEmail được xác minh tại oist.jp
Wenhao YangStanford UniversityEmail được xác minh tại stanford.edu
Hiroki FurutaThe University of TokyoEmail được xác minh tại weblab.t.u-tokyo.ac.jp
Shixiang Shane GuGoogle DeepMindEmail được xác minh tại google.com
Tatsuya MatsushimaThe University of TokyoEmail được xác minh tại weblab.t.u-tokyo.ac.jp
Yutaka MatsuoProfessor, University of TokyoEmail được xác minh tại weblab.t.u-tokyo.ac.jp
Mark RowlandResearch Scientist, Google DeepMindEmail được xác minh tại google.com
Toshinori KitamuraThe University of TokyoEmail được xác minh tại weblab.t.u-tokyo.ac.jp
Eiji UchibeDept. of Brain Robot Interface, ATR Computational Neuroscience Labs.Email được xác minh tại atr.jp
Martha WhiteUniversity of AlbertaEmail được xác minh tại ualberta.ca
Csaba SzepesvariDeepMind & University of AlbertaEmail được xác minh tại cs.ualberta.ca
Ryo YonetaniSenior Research Scientist at CyberAgentEmail được xác minh tại cyberagent.co.jp
Kenta HoshinoDepartment of Systems Science, Kyoto UniversityEmail được xác minh tại i.kyoto-u.ac.jp

Theo dõi

Tadashi Kozuno

OMRON SINIC X

Email được xác minh tại alumni.oist.jp - Trang chủ

reinforcement learning machine learning neuroscience


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist The 34th Conference on Neural Information Processing Systems, 2020	139*	2020
Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall T Kozuno, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	49*	2021
Theoretical analysis of efficiency and robustness of softmax and gap-increasing operators in reinforcement learning T Kozuno, E Uchibe, K Doya The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	44	2019
Greedification operators for policy optimization: Investigating forward and reverse kl divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White Journal of Machine Learning Research 23 (253), 1-79, 2022	35	2022
Revisiting Peng's Q () for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... The 38th International Conference on Machine Learning, 2021	25	2021
Robust Markov Decision Processes without Model Estimation W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248, 2023	21*	2023
Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms H Furuta, T Kozuno, T Matsushima, Y Matsuo, SS Gu Advances in Neural Information Processing Systems 35, 2021	21*	2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning H Furuta, T Matsushima, T Kozuno, Y Matsuo, S Levine, O Nachum, ... The 38th International Conference on Machine Learning, 2021	21	2021
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints K Kasaura, S Miura, T Kozuno, R Yonetani, K Hoshino, Y Hosoe IEEE Robotics and Automation Letters, 2023	15	2023
Adapting to game trees in zero-sum imperfect information games C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko International Conference on Machine Learning, 10093-10135, 2023	14	2023
Confident Approximate Policy Iteration for Efficient Local Planning in -realizable MDPs G Weisz, A György, T Kozuno, C Szepesvári Advances in Neural Information Processing Systems 35, 25547-25559, 2022	12	2022
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Y Tang, T Kozuno, M Rowland, R Munos, M Valko Advances in Neural Information Processing Systems 35, 2021	12	2021
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... Transactions on Machine Learning Research, 2022	11	2022
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	10	2022
Symmetry-aware reinforcement learning for robotic assembly under partial observability with a soft wrist H Nguyen, T Kozuno, CC Beltran-Hernandez, M Hamaya 2024 IEEE international conference on robotics and automation (ICRA), 9369-9375, 2024	9	2024
Variational oracle guiding for reinforcement learning D Han, T Kozuno, X Luo, ZY Chen, K Doya, Y Yang, D Li International Conference on Learning Representations, 2021	9	2021
Study of White-LED Using Amorphous Carbon Nitride Grown by RF-sputtering and ECR-plasma CVD T Kozuno, S Kishimoto, K Tachibana, K Itoh, Y Iwano, S Kunitsugu, ... Journal of Light & Visual Environment 35 (1), 86-89, 2011	7	2011
When to replan? an adaptive replanning strategy for autonomous navigation using deep reinforcement learning K Honda, R Yonetani, M Nishimura, T Kozuno 2024 IEEE International Conference on Robotics and Automation (ICRA), 6650-6656, 2024	3	2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees T Kitamura, T Kozuno, M Kato, Y Ichihara, S Nishimori, A Sannai, ... arXiv preprint arXiv:2401.17780, 2024	3	2024
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning T Kozuno, D Han, K Doya arXiv preprint arXiv:1906.07586, 2019	3	2019

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–20

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả