Other models of the same size will replicate the IOI circuits interpretability paper

Can you find the IOI capability in other models of the same size? (OPT small, Neo small, Mistral models)

How much do the Mistral models (GPT-2 Small & Medium trained on 5 random seeds) have similar outputs on any given text, vs varying a lot?

Deep LearningInterpretability & Explainability

Answers 0

No answers yet

No comments yet.