Michael Littman
Title
Cited by
Cited by
Year
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
77561996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
39501998
Markov games as a framework for multi-agent reinforcement learning
ML Littman
Machine learning proceedings 1994, 157-163, 1994
21961994
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman
ACM Transactions on Information Systems (TOIS) 21 (4), 315-346, 2003
19892003
Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman
Aaai 5 (2005), 1541-1546, 2005
18332005
Packet routing in dynamically changing networks: A reinforcement learning approach
JA Boyan, ML Littman
Advances in neural information processing systems, 671-678, 1994
8441994
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
8001994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
7851995
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38 (3), 287-308, 2000
7012000
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
6942013
Interactions between learning and evolution
D Ackley, M Littman
Artificial life II 10, 487-509, 1991
6681991
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
5872013
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
AR Cassandra, ML Littman, NL Zhang
arXiv preprint arXiv:1302.1525, 2013
5742013
Friend-or-foe Q-learning in general-sum games
ML Littman
ICML 1, 322-328, 2001
5382001
Predictive representations of state
ML Littman, RS Sutton
Advances in neural information processing systems, 1555-1561, 2002
5242002
Computerized cross-language document retrieval using latent semantic indexing
TK Landauer, ML Littman
US Patent 5,301,109, 1994
4901994
Algorithms for sequential decision making
ML Littman
Brown University, 1996
4881996
Unsupervised learning of semantic orientation from a hundred-billion-word corpus
PD Turney, ML Littman
arXiv preprint cs/0212012, 2002
3992002
Value-function reinforcement learning in Markov games
ML Littman
Cognitive systems research 2 (1), 55-66, 2001
3882001
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
3862006
The system can't perform the operation now. Try again later.
Articles 1–20