Preference-based Reinforcement Learning: A Formal Framework and a Policy Iteration Algorithm (2012), in: Machine Learning, 89:1-2(123--156) | , , and ,
[DOI] |
Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning, in: Proceedings of the 22nd European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2011, Athens, Greece), Part I, pages 312--327, Springer, 2011 | , , and ,
|
Label Ranking by Learning Pairwise Preferences (2008), in: Artificial Intelligence, 172:16–17(1897--1916) | , , and ,
[DOI] |