Open-ended

Open

Reverse engineering of 1 layer SoLU model

by Sabrina Zaki

How far can you get with really deeply reverse engineering a 1 layer SoLU model?

Which directions correspond to features?
Can you find any polysemanticneurons?
Can you fully reverse a feature direction and compare it to a neuron direction?

Answers 0

No answers yet

Discussion 0

No comments yet.