Richard S. Sutton

인용

	전체	2019년 이후
서지정보	152321	81342
h-index	90	66
i10-index	219	168

17000

8500

4250

12750

19901991199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024408 595 588 636 846 777 764 915 1049 1103 1177 1358 1627 1809 2112 2498 2583 3047 2987 3102 3229 3404 3541 3829 3798 3591 4167 5168 7641 9916 12263 13847 15183 16077 14036

공개 액세스

모두 보기

자료 23개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Andrew BartoUniversity of Massachusetts Amherstcs.umass.edu의 이메일 확인됨
Satinder SinghGoogle DeepMind / U. of Michiganumich.edu의 이메일 확인됨
Doina PrecupDeepMind and McGill Universitycs.mcgill.ca의 이메일 확인됨
Patrick M. PilarskiProfessor, University of Alberta, Amii (Alberta Machine Intelligence Institute)ualberta.ca의 이메일 확인됨
Charles Andersonprofessor of computer science, Colorado State University. Founder Pattern Exploration, LLC.colostate.edu의 이메일 확인됨
A. Rupam MahmoodUniversity of Alberta, Amiiualberta.ca의 이메일 확인됨
Adam WhiteUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)ualberta.ca의 이메일 확인됨
Csaba SzepesvariDeepMind & University of Albertacs.ualberta.ca의 이메일 확인됨
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of Scienceiisc.ac.in의 이메일 확인됨
Thomas DegrisDeepMindgoogle.com의 이메일 확인됨
Hamid MaeiNetflixnetflix.com의 이메일 확인됨
David SilverDeepMind, UCLgoogle.com의 이메일 확인됨
Martha WhiteUniversity of Albertaualberta.ca의 이메일 확인됨
David McAllesterProfessor, Toyota Technological Institute at Chicagottic.edu의 이메일 확인됨
Joseph ModayilOpenmind Research Institute & Keen AGIopenmindresearch.org의 이메일 확인됨
Yishay MansourTel Aviv Universitytauex.tau.ac.il의 이메일 확인됨
Elliot A. LudvigProfessor, Psychology, University of Warwickwarwick.ac.uk의 이메일 확인됨
E James KehoeProfessor of Psychology, University of New South Walesunsw.edu.au의 이메일 확인됨
Peter StoneProfessor of Computer Science, The University of Texas at Austincs.utexas.edu의 이메일 확인됨
Mohammad GhavamzadehAmazonamazon.com의 이메일 확인됨

팔로우

Richard S. Sutton

Keen, Amii, and University of Alberta

richsutton.com의 이메일 확인됨 - 홈페이지

artificial intelligence reinforcement learning machine learning cognitive science computer science


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Reinforcement learning: An Introduction, 2nd edition RS Sutton, AG Barto MIT press, 2018	77010*	2018
Policy gradient methods for reinforcement learning with function approximation RS Sutton, D McAllester, S Singh, Y Mansour Advances in neural information processing systems 12, 1999	8927	1999
Learning to predict by the methods of temporal differences RS Sutton Machine learning 3, 9-44, 1988	8253	1988
Reinforcement learning: An Introduction, 1st edition RS Sutton, AG Barto MIT press, 1998	6071*	1998
Neuronlike adaptive elements that can solve difficult learning control problems AG Barto, RS Sutton, CW Anderson IEEE transactions on systems, man, and cybernetics 13 (5), 834-846, 1983	5225	1983
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4620	1999
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming RS Sutton Proceedings of the International Conference on Machine Learning, 216-224, 1990	2281	1990
Generalization in reinforcement learning: Successful examples using sparse coarse coding RS Sutton Advances in neural information processing systems 8, 1995	1903	1995
Toward a modern theory of adaptive networks: Expectation and prediction. RS Sutton, AG Barto Psychological review 88 (2), 135, 1981	1861	1981
Neural networks for control WT Miller, PJ Werbos, RS Sutton MIT press, 1990	1849	1990
Temporal credit assignment in reinforcement learning RS Sutton University of Massachusetts, Amherst, http://www.incompleteideas.net/papers …, 1984	1251	1984
Dyna, an integrated architecture for learning, planning, and reacting RS Sutton ACM Sigart Bulletin 2 (4), 160-163, 1991	1246	1991
Introduction to reinforcement learning. Vol. 135 RS Sutton, AG Barto MIT press Cambridge 5, 21-22, 1998	1175	1998
Incremental natural actor-critic algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Advances in neural information processing systems, 2008	1108	2008
Reinforcement learning with replacing eligibility traces SP Singh, RS Sutton Machine learning 22 (1), 123-158, 1996	1055	1996
Eligibility traces for off-policy policy evaluation D Precup, RS Sutton, S Singh International Conference on Machine Learning 16, 759-766, 2000	999	2000
Time-derivative models of Pavlovian reinforcement. RS Sutton, AG Barto Learning and Computational Neuroscience: Foundations of Adaptive Networks …, 1990	853	1990
Reinforcement learning is direct adaptive optimal control RS Sutton, AG Barto, RJ Williams IEEE control systems magazine 12 (2), 19-22, 1992	845	1992
A menu of designs for reinforcement learning over time WT Miller, RS Sutton, PJ Werbos MIT press, 1995	776	1995
S., Barto A., G.,“ R Sutton Reinforcement Learning, An Introduction, 2000	742*	2000

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자