A Black Box Made Less Opaque (part 4)

Matthew McDonnell·LessWrong·Community·July 1, 2026

Understanding the effects of compression on model performance and interpretabilityI. Executive summaryThis is the fourth installment in a series of analyses exploring basic AI interpretability mechanics and techniques. While this analysis is designed to stand on its own, readers interested in a comparative analysis of representational geometry and the effects of manipulating feature activation will likely appreciate a review of part 1, part 2, and part 3 of this series.Key findings:Context: This...

Read full article →

A Black Box Made Less Opaque (part 4)

Related Articles