Learning from Delayed Rewards
Type of publication: | Phdthesis |
Citation: | watkins_phd89 |
Year: | 1989 |
School: | Cambridge University |
Abstract: | Descriptions of Watkins' Q($\lambda$) |
Userfields: | date-added={2012-09-03 15:47:30 +0200}, date-modified={2012-09-03 15:47:30 +0200}, project={fremdliteratur}, |
Keywords: | Learning |
Authors | |
Attachments
|
|
Notes
|
|
|
|
Topics
|
|
|