transformerlens

Star

Here are 4 public repositories matching this topic...

yash-srivastava19 / arrakis

Sponsor

Star

Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.

transformer garcon explainable-ai mechanistic-interpretability anthropic transformerlens

Updated Mar 6, 2025
Jupyter Notebook

ashioyajotham / exploring_saes

Star

Implementation and analysis of Sparse Autoencoders for neural network interpretability research. Features interactive visualization dashboard and W&B integration.

sparse-autoencoders interpretability activation-functions neuron-activity wandb transformerlens mech-interp

Updated Feb 27, 2025
Python

alexjackson1 / tx

Star

A Flax-based library for examining transformers, based on TransformerLens.

deep-learning transformers flax jax transformerlens

Updated Feb 11, 2024
Python

zilaeric / othello-gpt-probing

Star

Training and exploration of linear probes into Othello-GPT by Li et al. (2022)

probe othello gpt interpretability explainability transformerlens

Updated Jun 29, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the transformerlens topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformerlens topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformerlens

Here are 4 public repositories matching this topic...

yash-srivastava19 / arrakis

ashioyajotham / exploring_saes

alexjackson1 / tx

zilaeric / othello-gpt-probing

Improve this page

Add this topic to your repo