A Policy Iteration Algorithm for Learning from Preference-based Feedback
| Type of publication: | Inproceedings |
| Citation: | cwIDA13 |
| Booktitle: | Advances in Intelligent Data Analysis XII: 12th International Symposium (IDA-13) |
| Series: | LNCS |
| Volume: | 8207 |
| Year: | 2013 |
| Month: | October |
| Pages: | 427--437 |
| Publisher: | Springer-Verlag |
| DOI: | 10.1007/978-3-642-41398-8_37 |
| Keywords: | |
| Authors | |
| Editors | |
|
Topics
|
|
|
|
