Seguir
Denis Tarasov
Título
Citado por
Citado por
Ano
CORL: Research-oriented deep offline reinforcement learning library
D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
762024
Anti-exploration by random network distillation
A Nikulin, V Kurenkov, D Tarasov, S Kolesnikov
International Conference on Machine Learning, 26228-26244, 2023
242023
Revisiting the minimalist approach to offline reinforcement learning
D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
222024
Q-ensemble for offline rl: Don't scale the ensemble, scale the batch size
A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov
arXiv preprint arXiv:2211.11092, 2022
172022
Let offline rl flow: Training conservative agents in the latent space of normalizing flows
D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
arXiv preprint arXiv:2211.11096, 2022
112022
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning
D Tarasov, V Kurenkov, S Kolesnikov
ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 2022
42022
Predicting perceived ethnicity with data on personal names in Russia
A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ...
Journal of Computational Social Science 6 (2), 589-608, 2023
32023
Katakomba: tools and benchmarks for data-driven NetHack
V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
22024
Distilling LLMs' Decomposition Abilities into Compact Language Models
D Tarasov, K Shridhar
arXiv preprint arXiv:2402.01812, 2024
22024
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
D Tarasov, K Brilliantov, D Kharlapenko
arXiv preprint arXiv:2406.06309, 2024
12024
The Role of Deep Learning Regularizations on Actors in Offline RL
D Tarasov, A Surina, C Gulcehre
arXiv preprint arXiv:2409.07606, 2024
2024
Offline RL for generative design of protein binders
D Tarasov, UA Mbou Sob, M Arbesu, N Siboni, S Boyer, M Skwark, A Smit, ...
bioRxiv, 2023.11. 29.569328, 2023
2023
Fixing 1-bit Adam and 1-bit LAMB algorithms
D Tarasov, VA Ershov
Computing 15 (4), 86-97, 2022
2022
Predicting ethnicity with data on personal names in Russia
A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ...
2021
Revisiting Behavior Regularized Actor-Critic
D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov
Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 0
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–15