An Invisible Issue of Task Underspecification in Deep Reinforcement Learning Evaluations

Performance evaluations of Deep Reinforcement Learning (DRL) algorithms are an integral part of the scientific progress of the field. However, standard performance evaluation practices in evaluating algorithmic generalization of DRL methods within a task can be unreliable and misleading if not caref...

Full description

Bibliographic Details
Main Author: Jayawardana, Vindula Muthushan
Other Authors: Wu, Cathy
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/147493