Monosemanticity Search.

I want to see the activations for

and learn what's going on in a neural network.

Created by

Mustafa

&

Siddharth.

Indexed using

, with the data from

Anthropic's A/1

dictionary learning run.

We've also simplified the original paper with visuals, check it out

here.
Original research

from the

Anthropic

Team. Huge shoutout to everyone at Anthropic,

they have the absolute best

Mechanistic Intepretability

researchers. Everyone should follow

Trenton

and

Chris

on

they are brilliant!