TY - CONF ID - cwIDA13 T1 - A Policy Iteration Algorithm for Learning from Preference-based Feedback A1 - Wirth, Christian A1 - Fürnkranz, Johannes ED - Tucker, Allan ED - Höppner, Frank ED - Siebes, Arno ED - Swift, Stephen TI - Advances in Intelligent Data Analysis XII: 12th International Symposium (IDA-13) T3 - LNCS Y1 - 2013 VL - 8207 SP - 427 EP - 437 PB - Springer-Verlag M2 - doi: 10.1007/978-3-642-41398-8_37 ER -