🌱 Sanzhar B.

      • And Then There Were None, Agatha Christie
      • Scarlet Sails - Alexander Grin
      • Things Fall Apart, Chinua Achebe
      • Смерть Ивана Ильича
      • Vibrations of CO2 Molecule
      • Circuit Tracing. Revealing Computational Graphs in Language Models (Anthropic)
      • Key notes, papers to read
      • Neel Nanda's tutorial
      • Sparse Autoencoders
      • Sparse Dictionary Learning Family
      • Understanding Deep Learning Textbook
      • Some recordings of me playing on Dombyra
      • Cтоит ли учиться за границей?
      • Quotes
      • Summer Research at Princeton
      • Зачем я веду "блог"
      • Как же сложно учить что-то новое
      • Мои курсы на Fall term
      • Ну и куда ты пойдешь с физикой
      • Обсерватория Грифитта
      • Почему химия не для меня
      • Риск Лист
      • Чувство Homesick - правда или миф?
    Home

    ❯

    Mech Interp

    ❯

    Key notes, papers to read

    Key notes, papers to read

    Feb 01, 20261 min read

    While watching the ARENA Lecture 1 on Transformers, I found a link to Andrej Karpathy’s video titled “Let’s build the GPT Tokenizer.”

    https://www.alignmentforum.org/posts/jP9KDyMkchuv6tHwm/how-to-become-a-mechanistic-interpretability-researcher

    https://www.neelnanda.io/mechanistic-interpretability/prereqs

    https://www.alignmentforum.org/posts/NfFST5Mio7BCAQHPA/an-extremely-opinionated-annotated-list-of-my-favourite

    https://transformer-circuits.pub/2021/framework/index.html

    [Intepretability in the Wild - Video walkthrough](https://www.youtube.com/watch?v=gzwj0jWbvbo Mech interp is not pre-paradigmatic by Lee Sharkey


    Graph View

    Backlinks

    • No backlinks found

    Created with Quartz v4.4.0 © 2026

    • GitHub
    • Community of Ask