Gerald Tesauro

Gerald Tesauro
Gerald Tesauro
Nationality	American
Alma mater	University of Maryland, College Park (B.S. Physics); Princeton University (Ph.D. Physics, 1986)
Known for	TD-Gammon, IBM Watson
Awards	Hertz Foundation Fellow (1980); Fellow of the AAAI (2013); Fellow of the ACM (2018)
	Scientific career
Fields	Artificial neural network, Reinforcement learning, Autonomic computing
Institutions	IBM Research, University of Illinois Urbana-Champaign (postdoc)
Thesis	Steady-State Dynamics and Selection Principles in Nonequilibrium Pattern-Forming Systems (1986)
Doctoral advisor	Philip W. Anderson, Michael C. Cross

Gerald J. "Gerry" Tesauro is an American computer scientist and a researcher at IBM, known for his development of TD-Gammon, a backgammon program that taught itself to play at a world-championship level through self-play and temporal difference learning, an early success in reinforcement learning and neural networks. He subsequently researched on autonomic computing, multi-agent systems for e-commerce, and contributed to the game strategy algorithms for IBM Watson.