Michael Littman

Dikutip oleh

	Semua	Sejak 2019
Kutipan	62042	25164
indeks-h	99	65
indeks-i10	251	177

4900

2450

1225

3675

19961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024221 347 475 494 591 648 827 1074 1273 1504 1510 1852 1846 1920 2070 2113 2234 2344 2204 2246 2304 2463 2987 3380 4022 4234 4379 4875 4256

Akses publik

Lihat semua

49 artikel

0 artikel

tersedia

tidak tersedia

Berdasarkan pada mandat pendanaan

Ikuti

Michael Littman

Brown University

Email yang diverifikasi di brown.edu - Beranda

reinforcement learning machine learning artificial intelligence


Judul Urutkan menurut kutipan Urutkan menurut tahun Urutkan menurut judul	Dikutip oleh Dikutip oleh	Tahun
Reinforcement learning: A survey LP Kaelbling, ML Littman, AW Moore Journal of artificial intelligence research 4, 237-285, 1996	12229	1996
Planning and acting in partially observable stochastic domains LP Kaelbling, ML Littman, AR Cassandra Artificial intelligence 101 (1-2), 99-134, 1998	5910	1998
Markov games as a framework for multi-agent reinforcement learning ML Littman Machine learning proceedings 1994, 157-163, 1994	4172	1994
Measuring praise and criticism: Inference of semantic orientation from association PD Turney, ML Littman acm Transactions on Information Systems (tois) 21 (4), 315-346, 2003	2426	2003
Activity recognition from accelerometer data N Ravi, N Dandekar, P Mysore, ML Littman Aaai 5 (2005), 1541-1546, 2005	2322	2005
Packet routing in dynamically changing networks: A reinforcement learning approach J Boyan, M Littman Advances in neural information processing systems 6, 1993	1220	1993
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000	1043	2000
Learning policies for partially observable environments: Scaling up ML Littman, AR Cassandra, LP Kaelbling Machine Learning Proceedings 1995, 362-370, 1995	1043	1995
Acting optimally in partially observable stochastic domains AR Cassandra, LP Kaelbling, ML Littman Aaai 94, 1023-1028, 1994	1033	1994
Friend-or-foe Q-learning in general-sum games ML Littman ICML 1 (2001), 322-328, 2001	938	2001
Graphical models for game theory M Kearns, ML Littman, S Singh arXiv preprint arXiv:1301.2281, 2013	821	2013
On the complexity of solving Markov decision problems ML Littman, TL Dean, LP Kaelbling arXiv preprint arXiv:1302.4971, 2013	753	2013
Interactions between learning and evolution D Ackley, M Littman Artificial life II 10, 487-509, 1991	743	1991
Predictive representations of state M Littman, RS Sutton Advances in neural information processing systems 14, 2001	733	2001
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes AR Cassandra, ML Littman, NL Zhang arXiv preprint arXiv:1302.1525, 2013	700	2013
Computerized cross-language document retrieval using latent semantic indexing TK Landauer, ML Littman US Patent 5,301,109, 1994	665	1994
An analysis of model-based interval estimation for Markov decision processes AL Strehl, ML Littman Journal of Computer and System Sciences 74 (8), 1309-1331, 2008	657	2008
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006	654	2006
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006	636	2006
Algorithms for sequential decision-making ML Littman Brown University, 1996	607	1996

Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.

Artikel 1–20

Kutipan per tahun

Kutipan duplikat

Kutipan yang digabung

Tambahkan pengarang bersamaPengarang bersama

Ikuti

Dikutip oleh