Hypothesis
Levels of ablation of Transformer heads will gradually activate backup heads.
In Interpretability in the Wild, the backup name mover heads activate when the name mover heads are ablated. How do we expect backup name mover heads to respond to different amplitudes of ablation on the main name mover head?
Two expectations pop up, either they gradually activate or there is a significant phase shift in their behaviour. Also see the work on backup backup name mover heads.