Satinder Singh

Citado por

	Total	Desde 2019
Citas	45274	24451
Índice h	82	63
Índice i10	212	152

4800

2400

1200

3600

1994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024169 145 209 236 327 309 321 397 513 549 679 809 907 1082 1011 996 998 1006 980 1044 1052 1050 1352 1552 2487 3085 3838 4054 4480 4721 4256

Acceso público

Ver todo

40 artículos

1 artículo

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganDirección de correo verificada de umich.edu
Richard S. SuttonKeen, Amii, and University of AlbertaDirección de correo verificada de richsutton.com
Michael KearnsProfessor of Computer Science, University of PennsylvaniaDirección de correo verificada de cis.upenn.edu
Doina PrecupDeepMind and McGill UniversityDirección de correo verificada de cs.mcgill.ca
Andrew BartoUniversity of Massachusetts AmherstDirección de correo verificada de cs.umass.edu
Junhyuk OhResearch Scientist, DeepMindDirección de correo verificada de google.com
Yishay MansourTel Aviv UniversityDirección de correo verificada de tauex.tau.ac.il
Michael LittmanBrown UniversityDirección de correo verificada de brown.edu
David McAllesterProfessor, Toyota Technological Institute at ChicagoDirección de correo verificada de ttic.edu
Honglak LeeLG AI Research / U. MichiganDirección de correo verificada de umich.edu
David SilverDeepMind, UCLDirección de correo verificada de google.com
Tommi JaakkolaMITDirección de correo verificada de csail.mit.edu
Michael WellmanProfessor of Computer Science & Engineering, University of MichiganDirección de correo verificada de umich.edu
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLDirección de correo verificada de google.com
Tom ZahavyStaff Research Scientist, Google DeepMindDirección de correo verificada de deepmind.com
Yevgeniy VorobeychikWashington University in Saint LouisDirección de correo verificada de wustl.edu
Edmund DurfeeProfessor Emeritus of Computer Science and Engineering, University of MichiganDirección de correo verificada de umich.edu
Nan JiangAssociate Professor of Computer Science, UIUCDirección de correo verificada de illinois.edu
Xiaoxiao GuoLinkedInDirección de correo verificada de fb.com
Marilyn WalkerProfessor of Computer Science and Engineering, University of California Santa CruzDirección de correo verificada de ucsc.edu

Seguir

Satinder Singh

Google DeepMind / U. of Michigan

Dirección de correo verificada de umich.edu - Página principal

Reinforcement Learning Computational Game Theory Artificial Intelligence


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Policy Gradient Methods for Reinforcement Learning with Function Approximation R Sutton, D McAllester, S Singh, Y Mansour Neural Information Processing Systems, 1999	8920	1999
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4618	1999
Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995	1680	1995
Near-optimal reinforcement learning in polynomial time M Kearns, S Singh Machine learning 49, 209-232, 2002	1389	2002
Convergence of stochastic iterative dynamic programming algorithms T Jaakkola, M Jordan, S Singh Advances in neural information processing systems 6, 1993	1372	1993
Action-conditional video prediction using deep networks in atari games J Oh, X Guo, H Lee, RL Lewis, S Singh Advances in neural information processing systems 28, 2015	1103	2015
Intrinsically motivated reinforcement learning N Chentanez, A Barto, S Singh Advances in neural information processing systems 17, 2004	1056	2004
Reinforcement learning with replacing eligibility traces SP Singh, RS Sutton Machine learning 22 (1), 123-158, 1996	1054	1996
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000	1043	2000
Eligibility traces for off-policy policy evaluation D Precup, R Sutton, S Singh Computer Science Department Faculty Publication Series, 80, 2000	1001	2000
Graphical models for game theory M Kearns, ML Littman, S Singh arXiv preprint arXiv:1301.2281, 2013	821	2013
Predictive representations of state ML Littman, RS Sutton, S Singh Advances in neural information processing systems, 1555-1561, 2002	733	2002
Learning without state-estimation in partially observable Markovian decision processes SP Singh, T Jaakkola, MI Jordan Machine Learning Proceedings 1994, 284-292, 1994	630	1994
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	624	2021
Intrinsically motivated reinforcement learning: An evolutionary perspective S Singh, RL Lewis, AG Barto, J Sorg IEEE Transactions on Autonomous Mental Development 2 (2), 70-82, 2010	590	2010
Intrinsically motivated learning of hierarchical collections of skills AG Barto, S Singh, N Chentanez Proceedings of the 3rd International Conference on Development and Learning …, 2004	580	2004
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system S Singh, D Litman, M Kearns, M Walker Journal of Artificial Intelligence Research 16, 105-133, 2002	522	2002
Transfer of learning by composing solutions of elemental sequential tasks SP Singh Machine learning 8, 323-339, 1992	508	1992
Reinforcement Learning with Soft State Aggregation S Singh, T Jaakkola, M Jordan Neural Information Processing Systems, 1995	477	1995
Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning X Guo, S Singh, H Lee, RL Lewis, X Wang Advances in neural information processing systems 27, 2014	466	2014

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores