Speeding Up JumpReLU SAE Inference with Custom Triton Kernels (2–14× on Real SAEs)

Daniel Tiourine·LessWrong·Community·June 14, 2026

MotivationSparse Autoencoders (SAEs) have become a central tool in mechanistic interpretability research, providing a way to decompose a model's internal activations into sparse, interpretable features. However, extracting these features often requires running the SAE over large volumes of activations across many layers and tokens. This makes SAE inference efficiency a practical bottleneck for interpretability research at scale. This post focuses on improving the inference efficiency of JumpReLU...

Read full article →

Speeding Up JumpReLU SAE Inference with Custom Triton Kernels (2–14× on Real SAEs)

Related Articles