Richard S. Sutton - Webdocs Cs Ualberta - University of Alberta

More documents

Recommendations

Info

$Formatting Instructions for Authors Using LaTeX - the Department of ...$

Research Pioneered and made repeated contributions to reinforcement learning, an approach to artificial and natural intelligence that emphasizes learning and planning from sample experience. Currently seeking to extend reinforcement learning ideas to an empirically grounded approach to knowledge representation. Most significant contributions: • The theory of temporal-difference learning and the TD(λ) algorithm (1988). • The standard textbook for reinforcement learning (with Barto, 1998) • The actor-critic (policy gradient) class of algorithms (1984, 2000). • The Dyna architecture integrating learning, planning and reacting (1990) • Temporal-difference models of animal learning (with Barto, 1981, 1990) • The “options” framework for temporal abstraction (with Precup, Singh, 1999) • Predictive state representations (with Littman, Singh, 2002) • Algorithms for online step-size adaptation (1981, 1992) • Temporal-difference networks for grounded knowledge representation (2005, 2006) • Gradient temporal-difference algorithms (with Maei, Szepesvari, 2008–) Selected Grants NSERC Collaborative Research and Development Grant, with Nortel Networks and Bell Canada, “Learning and Prediction in High-dimensional Stochastic Domains,” September 2006 – August 2009, funded at CAD$186,523. One of five principal investigators. iCORE Chair and Professorship Establishment Grant, “Reinforcement Learning and Artificial Intelligence,” September 1, 2003 – August 31, 2008, funded at CAD$3,000,000. Principal investigator. Renewed until August 2013 at an additional CAD$2,750,000. Alberta Ingenuity Centre Grant, “Alberta Ingenuity Centre for Machine Learning,” April 2003 – March 2008, funded at CAD$9,887,600. One of eight principal investigators. Renewed until March 2009 at CAD$2,000,000. Renewed again in 2009 for another five years at CAD$10,000,000. NSERC Discovery Grant, “Reinforcement Learning and Artificial Intelligence,” April, 2004 – March 2009, funded at CAD$250,000. Principal investigator. Renewed in 2009 for a second five years at an additional CAD$190,000. Air Force Office of Scientific Research to the University of Massachusetts, “Stochastic Scheduling and Planning Using Reinforcement Learning,” AFOSR Grant Number F49620-96-1-0254, June 1, 1996 – May 31, 2000, funded at USD$446,570. Coprincipal investigator with A. Barto. 2
National Science Foundation to the University of Massachusetts, “Multiple Time Scale Reinforcement Learning,” Communications & Computational Systems/Neuroengineering Grant ECS-9511805, September 15, 1995 – August 31, 1998, funded at USD$157,261. Primary senior personnel (A. Barto, PI). Teaching Experience Courses Intelligent Systems, CMPUT 366, an introduction to artificial intelligence for undergraduates at the University of Alberta, 2008–2010 Reinforcement Learning for Artificial Intelligence, CMPUT 499/609, cross-listed undergraduate/graduate course at the University of Alberta, 2003–2007, 2009-10 Non-procedural Programming Languages, CMPUT 325, undergraduate course at the University of Alberta, Fall 2006 Reinforcement Learning in Practice, CMPUT 607, graduate course at the University of Alberta, Spring 2005 Reinforcement Learning, CMPSCI 791O, graduate course at the University of Massachusetts, with A. Barto, Fall 1996 and 1997 Reinforcement Learning, special intensive course at the University of Uppsala, Sweden, April–May 1996 Reinforcement Learning, CMPSCI 791N, graduate seminar at the University of Massachusetts, with A. Barto, Fall 1995 Cybernetics of Adaptation and Learning, COINS 791M, graduate seminar, teaching assistant including some lecturing, Fall 1981 Summer Schools Instructor at Machine Learning Summer School, Ile de Rey, France, 2008 Instructor at Cambridge University Neural Networks Summer School (three lectures each summer), 1993–1997 Instructor at Cold Spring Harbor Summer School on Computational Neuroscience: Learning and Memory, July 1990 3
Page 1: CURRICULUM VITAE Richard S. Sutton
Page 5 and 6: Honors iCORE chair, 2003; renewed 2
Page 7 and 8: Selected Invited Presentations “L
Page 9 and 10: 5. Kehoe, E. J., Ludvig, E. A., Sut
Page 11 and 12: 30. Barto, A. G., Sutton, R. S., Wa
Page 13 and 14: 53. Tanner, B., Sutton, R. S., “T
Page 15 and 16: 77. Sutton, R. S., “Integrated ar
Page 17 and 18: 99. Sutton, R. S., “Artificial in

Richard S. Sutton - Webdocs Cs Ualberta - University of Alberta

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?