Gerald Tesauro
| Gerald Tesauro | |
|---|---|
| Nationality | American | 
| Alma mater | University of Maryland, College Park (B.S. Physics) Princeton University (Ph.D. Physics, 1986) | 
| Known for | TD-Gammon, IBM Watson | 
| Awards | Hertz Foundation Fellow (1980) Fellow of the AAAI (2013) Fellow of the ACM (2018) | 
| Scientific career | |
| Fields | Artificial neural network, Reinforcement learning, Autonomic computing | 
| Institutions | IBM Research, University of Illinois Urbana-Champaign (postdoc) | 
| Thesis | Steady-State Dynamics and Selection Principles in Nonequilibrium Pattern-Forming Systems (1986) | 
| Doctoral advisor | Philip W. Anderson, Michael C. Cross | 
Gerald J. "Gerry" Tesauro is an American computer scientist and a researcher at IBM, known for his development of TD-Gammon, a backgammon program that taught itself to play at a world-championship level through self-play and temporal difference learning, an early success in reinforcement learning and neural networks. He subsequently researched on autonomic computing, multi-agent systems for e-commerce, and contributed to the game strategy algorithms for IBM Watson.