Hypothesis
Trap-Door Environments for MineRL Agents
Proposal
A "change everything" button in a MineRL environment that instantly changes the environment through Stable Diffusion or some other fast generative model, to observe the change in learned representations and goal generalization.
Reinforcement LearningTheoryInterpretability & Explainability