Denis Tarasov

Citado por

	Todos	Desde 2019
Citações	162	162
Índice h	5	5
Índice i10	5	5

120

2023202439 119

Coautores

Vladislav Kurenkovdunnolab, AIRIE-mail confirmado em innopolis.ru
Alexander Nikulindunnolab.aiE-mail confirmado em edu.hse.ru
Sergey KolesnikovE-mail confirmado em phystech.edu
Dmitry AkimovIndependent ResearcherE-mail confirmado em edu.hse.ru
Alexey BessudnovUniversity of ExeterE-mail confirmado em exeter.ac.uk
Veronica KostenkoTel Aviv UniversityE-mail confirmado em tauex.tau.ac.il
Ivan SmirnovUniversity of Technology SydneyE-mail confirmado em uts.edu.au
Kumar ShridharETH ZurichE-mail confirmado em inf.ethz.ch
Kirill BrilliantovStudent, ETH ZurichE-mail confirmado em student.ethz.ch
Marcin J. SkwarkBioinformatics Lead, InstaDeepE-mail confirmado em skwark.pl
Arnu PretoriusStaff Research Scientist, InstaDeep LtdE-mail confirmado em instadeep.com
Miguel Arbesú AndrésInstaDeepE-mail confirmado em instadeep.com
Oliver BentInstaDeepE-mail confirmado em instadeep.com
Dries SmitInstaDeep, Stellenbosch UniversityE-mail confirmado em sun.ac.za
Ulrich A. Mbou SobResearch ScientistE-mail confirmado em instadeep.com
Nima H. siboniResearch Engineer in InstaDeepE-mail confirmado em instadeep.com
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMindE-mail confirmado em google.com

Seguir

Denis Tarasov

ETH Zurich

E-mail confirmado em ethz.ch - Página inicial

Machine Learning Deep Learning Reinforcement Learning NLP


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
CORL: Research-oriented deep offline reinforcement learning library D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024	76	2024
Anti-exploration by random network distillation A Nikulin, V Kurenkov, D Tarasov, S Kolesnikov International Conference on Machine Learning, 26228-26244, 2023	24	2023
Revisiting the minimalist approach to offline reinforcement learning D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024	22	2024
Q-ensemble for offline rl: Don't scale the ensemble, scale the batch size A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov arXiv preprint arXiv:2211.11092, 2022	17	2022
Let offline rl flow: Training conservative agents in the latent space of normalizing flows D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov arXiv preprint arXiv:2211.11096, 2022	11	2022
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning D Tarasov, V Kurenkov, S Kolesnikov ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 2022	4	2022
Predicting perceived ethnicity with data on personal names in Russia A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ... Journal of Computational Social Science 6 (2), 589-608, 2023	3	2023
Katakomba: tools and benchmarks for data-driven NetHack V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024	2	2024
Distilling LLMs' Decomposition Abilities into Compact Language Models D Tarasov, K Shridhar arXiv preprint arXiv:2402.01812, 2024	2	2024
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? D Tarasov, K Brilliantov, D Kharlapenko arXiv preprint arXiv:2406.06309, 2024	1	2024
The Role of Deep Learning Regularizations on Actors in Offline RL D Tarasov, A Surina, C Gulcehre arXiv preprint arXiv:2409.07606, 2024		2024
Offline RL for generative design of protein binders D Tarasov, UA Mbou Sob, M Arbesu, N Siboni, S Boyer, M Skwark, A Smit, ... bioRxiv, 2023.11. 29.569328, 2023		2023
Fixing 1-bit Adam and 1-bit LAMB algorithms D Tarasov, VA Ershov Computing 15 (4), 86-97, 2022		2022
Predicting ethnicity with data on personal names in Russia A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ...		2021
Revisiting Behavior Regularized Actor-Critic D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 0

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–15

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores