Speeding Up JumpReLU SAE Inference with Custom Triton Kernels (2–14× on Real SAEs)

·LessWrong··

MotivationSparse Autoencoders (SAEs) have become a central tool in mechanistic interpretability research, providing a way to decompose a model's internal activations into sparse, interpretable features. However, extracting these features often requires running the SAE over large volumes of activations across many layers and tokens. This makes SAE inference efficiency a practical bottleneck for interpretability research at scale. This post focuses on improving the inference efficiency of JumpReLU...

Read full article →

Related Articles

Noise infusion banned from statistical products published by Census Bureau
nl · Hacker News · 15h ago
The Redistribution of Housing Wealth Caused by Rent Control [pdf]
luu · Hacker News · 2h ago
Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Packages
qwertox · Hacker News · 17h ago
Treating pancreatic tumours may have revealed cancer's master switch
andsoitis · Hacker News · 16h ago
Twenty One Zero-Days in FFmpeg
redbell · Hacker News · 1d ago