Ilya Kostrikov

Sitert av

	Alle	Siden 2020
Sitater	7280	6604
h-indeks	29	28
i10-indeks	32	31

2200

1100

550

1650

201620172018201920202021202220232024202543 116 191 252 363 700 1038 1680 2149 671

Offentlig tilgang

Vis alle

11 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Sergey LevineUC Berkeley, Physical IntelligenceVerifisert e-postadresse på eecs.berkeley.edu
Rob FergusResearch Scientist, DeepMind. Professor of Computer Science, New York UniversityVerifisert e-postadresse på cs.nyu.edu
Denis YaratsCofounder and CTO, Perplexity AIVerifisert e-postadresse på perplexity.ai
Ofir NachumOpenAIVerifisert e-postadresse på openai.com
Jonathan TompsonGoogleVerifisert e-postadresse på google.com
Laura SmithUC BerkeleyVerifisert e-postadresse på berkeley.edu
Ashvin NairOpenAIVerifisert e-postadresse på berkeley.edu
Michael JannerOpenAIVerifisert e-postadresse på openai.com
Juergen GallUniversity of BonnVerifisert e-postadresse på iai.uni-bonn.de
Tobias WeyandGoogle ReserachVerifisert e-postadresse på google.com
Benjamin EysenbachPrinceton UniversityVerifisert e-postadresse på princeton.edu
Bastian LeibeProfessor for Computer Vision, RWTH Aachen UniversityVerifisert e-postadresse på vision.rwth-aachen.de
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinVerifisert e-postadresse på austin.utexas.edu
Sainbayar SukhbaatarFAIR team, Meta AIVerifisert e-postadresse på fb.com
Debidatta DwibediGoogle DeepmindVerifisert e-postadresse på google.com
Joan BrunaProfessor of Computer Science, Data Science & Mathematics (aff), Courant Institute and CDS, NYUVerifisert e-postadresse på cims.nyu.edu
Denis ZorinProfessor of Computer Science and Mathematics, Courant Institute, NYUVerifisert e-postadresse på cs.nyu.edu
Scott EmmonsUC BerkeleyVerifisert e-postadresse på berkeley.edu
Vitaly KurinResearch Scientist at Isomorphic LabsVerifisert e-postadresse på isomorphiclabs.com
Roberta RaileanuResearch Scientist at Meta, Honorary Lecturer at UCL

Følg

Ilya Kostrikov

OpenAI

Verifisert e-postadresse på openai.com - Startside


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Offline reinforcement learning with implicit q-learning I Kostrikov, A Nair, S Levine arXiv preprint arXiv:2110.06169, 2021	956	2021
Image augmentation is all you need: Regularizing deep reinforcement learning from pixels I Kostrikov, D Yarats, R Fergus arXiv preprint arXiv:2004.13649, 2020	911*	2020
Planet-photo geolocation with convolutional neural networks T Weyand, I Kostrikov, J Philbin Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	547	2016
Improving sample efficiency in model-free reinforcement learning from images D Yarats, A Zhang, I Kostrikov, B Amos, J Pineau, R Fergus Proceedings of the aaai conference on artificial intelligence 35 (12), 10674 …, 2021	491	2021
Intrinsic motivation and automatic curricula via asymmetric self-play S Sukhbaatar, Z Lin, I Kostrikov, G Synnaeve, A Szlam, R Fergus arXiv preprint arXiv:1703.05407, 2017	450	2017
Discriminator-actor-critic: Addressing sample inefficiency and reward bias in adversarial imitation learning I Kostrikov, KK Agrawal, D Dwibedi, S Levine, J Tompson arXiv preprint arXiv:1809.02925, 2018	359	2018
Offline Reinforcement Learning with Fisher Divergence Critic Regularization I Kostrikov, J Tompson, R Fergus, O Nachum arXiv preprint arXiv:2103.08050, 2021	350	2021
Gpt-4o system card A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ... arXiv preprint arXiv:2410.21276, 2024	293	2024
Training diffusion models with reinforcement learning K Black, M Janner, Y Du, I Kostrikov, S Levine arXiv preprint arXiv:2305.13301, 2023	290	2023
Algaedice: Policy gradient from arbitrary experience O Nachum, B Dai, I Kostrikov, Y Chow, L Li, D Schuurmans arXiv preprint arXiv:1912.02074, 2019	273	2019
Automatic data augmentation for generalization in deep reinforcement learning R Raileanu, M Goldstein, D Yarats, I Kostrikov, R Fergus arXiv preprint arXiv:2006.12862, 2020	244*	2020
Pytorch implementations of reinforcement learning algorithms I Kostrikov GitHub repository: https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail, 2018	244	2018
Imitation learning via off-policy distribution matching I Kostrikov, O Nachum, J Tompson arXiv preprint arXiv:1912.05032, 2019	230	2019
Rvs: What is essential for offline rl via supervised learning? S Emmons, B Eysenbach, I Kostrikov, S Levine arXiv preprint arXiv:2112.10751, 2021	218	2021
An efficient convolutional network for human pose estimation. U Rafi, B Leibe, J Gall, I Kostrikov BMVC 1, 2, 2016	179	2016
Efficient Online Reinforcement Learning with Offline Data PJ Ball, L Smith, I Kostrikov*, S Levine arXiv preprint arXiv:2302.02948, 2023	174	2023
Idql: Implicit q-learning as an actor-critic method with diffusion policies P Hansen-Estruch, I Kostrikov, M Janner, JG Kuba, S Levine arXiv preprint arXiv:2304.10573, 2023	138	2023
A walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning L Smith, I Kostrikov, S Levine arXiv preprint arXiv:2208.07860, 2022	133*	2022
Offline rl for natural language generation with implicit language q learning C Snell, I Kostrikov, Y Su, M Yang, S Levine arXiv preprint arXiv:2206.11871, 2022	109	2022
Openai o1 system card A Jaech, A Kalai, A Lerer, A Richardson, A El-Kishky, A Low, A Helyar, ... arXiv preprint arXiv:2412.16720, 2024	106	2024

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere