Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

Wed Oct 02 2024 · json · rss

Listen:

Subscribe:

JSON

About

Anthropic, AI, ML, NotebookLM