
The Perils of Trial-and-Error Reward Design
Trial-and-error reward design is unsanctioned, but the implications of this widespread practice have not been studied. We conduct empirical computational and user study experiments, and find that trial and error leads to overfit and otherwise misdesigned reward functions. Published at AAAI 2023.
Project Webpage