Segui
Matthieu Zimmer
Matthieu Zimmer
RL Research Scientist @ Huawei Noah’s Ark Lab
Email verificata su matthieu-zimmer.net - Home page
Titolo
Citata da
Citata da
Anno
A survey on interpretable reinforcement learning
C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu
Machine Learning, 1-44, 2024
952024
Learning fair policies in multi-objective (deep) reinforcement learning with average and discounted rewards
U Siddique, P Weng, M Zimmer
International Conference on Machine Learning, 8905-8915, 2020
952020
Teacher-student framework: a reinforcement learning approach
M Zimmer, P Viappiani, P Weng
AAMAS Workshop Autonomous Robots and Multirobot Systems, 2014
862014
Learning fair policies in decentralized cooperative multi-agent reinforcement learning
M Zimmer, C Glanois, U Siddique, P Weng
International Conference on Machine Learning, 12967-12978, 2021
622021
Invariant transform experience replay: Data augmentation for deep reinforcement learning
Y Lin, J Huang, M Zimmer, Y Guan, J Rojas, P Weng
IEEE Robotics and Automation Letters 5 (4), 6615-6622, 2020
432020
Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results
M Zimmer, S Doncieux
IEEE Transactions on Cognitive and Developmental Systems, 2017
312017
Neuro-symbolic hierarchical rule induction
C Glanois, Z Jiang, X Feng, P Weng, M Zimmer, D Li, W Liu, J Hao
International Conference on Machine Learning, 7583-7615, 2022
282022
Differentiable logic machines
M Zimmer, X Feng, C Glanois, Z Jiang, J Zhang, P Weng, D Li, J Hao, ...
arXiv preprint arXiv:2102.11529, 2021
252021
Pangu-agent: A fine-tunable generalist agent with structured reasoning
F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ...
arXiv preprint arXiv:2312.14878, 2023
152023
Neural Fitted Actor-Critic
M Zimmer, Y Boniface, A Dutech
ESANN - European Symposium on Artificial Neural Networks, Computational …, 2016
152016
Developmental reinforcement learning through sensorimotor space enlargement
M Zimmer, Y Boniface, A Dutech
International Conference on Development and Learning and on Epigenetic Robotics, 2018
142018
End-to-end meta-bayesian optimisation with transformer neural processes
A Maraval, M Zimmer, A Grosnit, H Bou Ammar
Advances in Neural Information Processing Systems 36, 2024
132024
Exploiting the sign of the advantage function to learn deterministic policies in continuous domains
M Zimmer, P Weng
International Joint Conference on Artificial Intelligence, 2019
122019
Apprentissage par renforcement développemental
M Zimmer
Université de Lorraine, 2018
122018
Hyperparameter auto-tuning in self-supervised robotic learning
J Huang, J Rojas, M Zimmer, H Wu, Y Guan, P Weng
IEEE Robotics and Automation Letters 6 (2), 3537-3544, 2021
92021
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning
CE Mower, Y Wan, H Yu, A Grosnit, J Gonzalez-Billandon, M Zimmer, ...
arXiv preprint arXiv:2406.19741, 2024
52024
Towards More Sample Efficiency inReinforcement Learning with Data Augmentation
Y Lin, J Huang, M Zimmer, J Rojas, P Weng
Robot Learning Workshop, NeurIPS 2019, 2019
52019
Lightweight Structural Choices Operator for Technology Mapping
A Grosnit, M Zimmer, R Tutunov, X Li, L Chen, F Yang, M Yuan, ...
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023
22023
Off-Policy Neural Fitted Actor-Critic
M Zimmer, Y Boniface, A Dutech
Deep Reinforcement Learning Workshop, NIPS 2016, 2016
22016
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
PJ Gorinski, M Zimmer, G Lampouras, DGX Deik, I Iacobacci
arXiv preprint arXiv:2310.13669, 2023
12023
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20