Empirical evaluation methods for multiobjective reinforcement learning algorithms

Peter Vamplew,Richard Dazeley,Adam Berry,Rustam Issabekov,Evan Dekker

Machine Learning（2010）

引用 128|浏览1

暂无评分

摘要

While a number of algorithms for multiobjective reinforcement learning have been proposed, and a small number of applications developed, there has been very little rigorous empirical evaluation of the performance and limitations of these algorithms. This paper proposes standard methods for such empirical evaluation, to act as a foundation for future comparative studies. Two classes of multiobjective reinforcement learning algorithms are identified, and appropriate evaluation metrics and methodologies are proposed for each class. A suite of benchmark problems with known Pareto fronts is described, and future extensions and implementations of this benchmark suite are discussed. The utility of the proposed evaluation methods are demonstrated via an empirical comparison of two example learning algorithms.

查看译文

关键词

Multiobjective reinforcement learning,Multiple objectives,Empirical methods,Pareto fronts,Pareto optimal policies

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要