The Perils of Trial-and-Error Reward Design
Trial-and-error reward design is unsanctioned, but the implications of this widespread practice have
not been studied. We conduct empirical computational and user study experiments, and we find that
trial and error leads to the design of reward functions which are overfit and otherwise misdesigned.
Published at AAAI 2023.
Video: Reward Design Perils