Hypothesis
Open

Other models of the same size will replicate the IOI circuits interpretability paper

Can you find the IOI capability in other models of the same size? (OPT small, Neo small, Mistral models)

How much do the Mistral models (GPT-2 Small & Medium trained on 5 random seeds) have similar outputs on any given text, vs varying a lot?

Relates to the other IOI extension idea.

Deep LearningInterpretability & Explainability

Answers 0

No answers yet

Discussion 0

No comments yet.