Doina Precup

Citeras av

	Alla	Sedan 2019
Citat	36906	27754
h-index	68	59
i10-index	245	199

7000

3500

1750

5250

20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024117 117 178 219 246 308 331 318 322 378 407 484 592 607 882 1098 1899 2610 3438 4338 5293 6037 5982

Offentlig åtkomst

Visa alla

60 artiklar

8 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerifierad e-postadress på cs.mcgill.ca
Satinder SinghGoogle DeepMind / U. of MichiganVerifierad e-postadress på umich.edu
Prakash PanangadenProfessor of Computer Science, McGill UniversityVerifierad e-postadress på cs.mcgill.ca
Tal ArbelProfessor of Electrical & Computer Engineering, McGill UniversityVerifierad e-postadress på cim.mcgill.ca
Riashat IslamResearch ScientistVerifierad e-postadress på mail.mcgill.ca
Andre BarretoResearch Scientist, Google DeepMindVerifierad e-postadress på google.com
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerifierad e-postadress på umontreal.ca
Emmanuel BengioMcGill University; Recursion/Valence LabsVerifierad e-postadress på mail.mcgill.ca
David SilverDeepMind, UCLVerifierad e-postadress på google.com
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerifierad e-postadress på technion.ac.il
Jean HarbOpenAIVerifierad e-postadress på openai.com
GUILHERME SANT'ANNAProfessor (Full) of Pediatrics, McGill UniversityVerifierad e-postadress på mcgill.ca
Philip WarrickPerigen Canada; McGill UniversityVerifierad e-postadress på perigen.com
Csaba SzepesvariDeepMind & University of AlbertaVerifierad e-postadress på cs.ualberta.ca
Norm FernsVerifierad e-postadress på normferns.com
Jordan FrankSoftware Engineer, FacebookVerifierad e-postadress på cs.mcgill.ca
Pablo Samuel CastroGoogleVerifierad e-postadress på google.com
Amir-massoud FarahmandPolytechnique Montreal, Mila, University of TorontoVerifierad e-postadress på cs.toronto.edu
Hamid MaeiNetflixVerifierad e-postadress på netflix.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerifierad e-postadress på google.com

Följ

Doina Precup

DeepMind and McGill University

Verifierad e-postadress på cs.mcgill.ca

Artificial Intelligence machine learning reinforcement learning


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
The multimodal brain tumor image segmentation benchmark (BRATS) BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ... IEEE transactions on medical imaging 34 (10), 1993-2024, 2014	6210	2014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4620	1999
Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2519	2018
Off-policy deep reinforcement learning without exploration S Fujimoto, D Meger, D Precup International conference on machine learning, 2052-2062, 2019	1713	2019
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1313	2017
Eligibility traces for off-policy policy evaluation D Precup Computer Science Department Faculty Publication Series, 80, 2000	999	2000
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	737	2009
Learning with pseudo-ensembles P Bachman, O Alsharif, D Precup Advances in neural information processing systems 27, 2014	708	2014
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	624	2021
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	624	2011
Algorithms for multi-armed bandit problems V Kuleshov, D Precup arXiv preprint arXiv:1402.6028, 2014	595	2014
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation T Nair, D Precup, DL Arnold, T Arbel Medical image analysis 59, 101557, 2020	494	2020
Learning options in reinforcement learning M Stolle, D Precup Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002	484	2002
Off-policy temporal-difference learning with function approximation D Precup, RS Sutton, S Dasgupta ICML, 417-424, 2001	473	2001
Temporal abstraction in reinforcement learning D Precup University of Massachusetts Amherst, 2000	410	2000
Metrics for Finite Markov Decision Processes. N Ferns, P Panangaden, D Precup UAI 4, 162-169, 2004	373	2004
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	360	2015
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	357	2009
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control R Islam, P Henderson, M Gomrokchi, D Precup arXiv preprint arXiv:1708.04133, 2017	335	2017
Deep learning, reinforcement learning, and world models Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver, M Sugiyama, E Uchibe, ... Neural Networks 152, 267-275, 2022	327	2022

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare