A Policy Iteration Algorithm for Learning from Preference-based Feedback
Type of publication: | Inproceedings |
Citation: | cwIDA13 |
Booktitle: | Advances in Intelligent Data Analysis XII: 12th International Symposium (IDA-13) |
Series: | LNCS |
Volume: | 8207 |
Year: | 2013 |
Month: | October |
Pages: | 427--437 |
Publisher: | Springer-Verlag |
DOI: | 10.1007/978-3-642-41398-8_37 |
Keywords: | |
Authors | |
Editors | |
Topics
|
|
|