Gonçalo Teixeira
Writing
Index
About
|
PT
Index
›
People
Person
Adly Templeton
Anthropic
Papers authored
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
October 4, 2023
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
May 21, 2024
Essays referencing this
Opening the Black Box
April 25, 2026