Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

· json · rss
Listen:
Subscribe:

About

Anthropic, AI, ML, NotebookLM