Hypothesis
Open

Trap-Door Environments for MineRL Agents

Proposal

A "change everything" button in a MineRL environment that instantly changes the environment through Stable Diffusion or some other fast generative model, to observe the change in learned representations and goal generalization.

Reinforcement LearningTheoryInterpretability & Explainability

Answers 0

No answers yet

Discussion 2