Keywords:
Publications of Weiwei Cheng
2012
Preference-based Reinforcement Learning: A Formal Framework and a Policy Iteration Algorithm (2012), in: Machine Learning, 89:1-2(123--156) | , , and ,
[DOI] |
2011
Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning, in: Proceedings of the 22nd European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2011, Athens, Greece), Part I, pages 312--327, Springer, 2011 | , , and ,
|
2008
Label Ranking by Learning Pairwise Preferences (2008), in: Artificial Intelligence, 172:16–17(1897--1916) | , , and ,
[DOI] |