Gerald Tesauro

Gerald Tesauro
NationalityAmerican
Alma materUniversity of Maryland, College Park (B.S. Physics)
Princeton University (Ph.D. Physics, 1986)
Known forTD-Gammon, IBM Watson
AwardsHertz Foundation Fellow (1980)
Fellow of the AAAI (2013)
Fellow of the ACM (2018)
Scientific career
FieldsArtificial neural network, Reinforcement learning, Autonomic computing
InstitutionsIBM Research, University of Illinois Urbana-Champaign (postdoc)
ThesisSteady-State Dynamics and Selection Principles in Nonequilibrium Pattern-Forming Systems (1986)
Doctoral advisorPhilip W. Anderson, Michael C. Cross

Gerald J. "Gerry" Tesauro is an American computer scientist and a researcher at IBM, known for his development of TD-Gammon, a backgammon program that taught itself to play at a world-championship level through self-play and temporal difference learning, an early success in reinforcement learning and neural networks. He subsequently researched on autonomic computing, multi-agent systems for e-commerce, and contributed to the game strategy algorithms for IBM Watson.