- Generalized domains for empirical evaluations in reinforcement learning
- The 4th workshop on Evaluation Methods for Machine Learning, Montreal, Canada
- Book/source title
- Proceedings of the 4th workshop on Evaluation Methods for Machine Learning at ICML-09, Montreal, Canada
- Document type
- Conference contribution
- Faculty of Science (FNWI)
- Informatics Institute (IVI)
Many empirical results in reinforcement learning are based on a very small set of environments. These results often represent the best algorithm parameters that were found after an ad-hoc tuning or fitting process. We argue that presenting tuned scores from a small set of environments leads to method overfitting, wherein results may not generalize to similar environments. To address this problem, we advocate empirical evaluations using generalized domains: parameterized problem generators that explicitly encode variations in the environment to which the learner should be robust. We argue that evaluating across a set of these generated problems offers a more meaningful evaluation of reinforcement learning algorithms.
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.