MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Apple ML Research·AI·July 2, 2026

Understanding how transformer components operate in LLMs is important, as it is at the core of recent technological advances in artificial intelligence. In this work, we revisit the challenges associated with interpretability of feed-forward modules (FFNs) and propose MemoryLLM, which aims to decouple FFNs from self-attention and enables us to study the decoupled FFNs as context-free token-wise neural retrieval memory. In detail, we investigate how input tokens access memory locations within FFN...

Read full article →

MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Related Articles