Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
Article Status
Published
Title
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
Accessed
24/05/2024, 13:28
Citation
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet. (n.d.). Retrieved May 24, 2024, from https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html
Technical methods
Link to this record