Exploration: fine-tuning with parameter decomposition

·LessWrong··

TL;DR: We can destroy a 67M-parameter language model's ability to predict German text by fine-tuning a single number: the scalar prefactor on one German-related rank-1 parameter subcomponent. This is an early exploration into using parameter decomposition for a more targeted and interpretable form of model fine-tuning. At small German-token budgets, fine-tuning the scalar prefactor of a single German-related parameter subcomponent beats rank-1 and rank-4 LoRA [1] fine-tunes on the trade-off betw...

Read full article →

Related Articles

Anthropic says Alibaba illicitly extracted Claude AI model capabilities
htrp · Hacker News · 22h ago
Ford AI hiccups push carmaker to rehire ‘gray beard’ inspectors
alanwreath · Hacker News · 3h ago
Show HN: I made Google Trends for Hacker News by indexing 18 years of comments
ytkimirti · Hacker News · 4h ago
OpenAI unveils its first custom chip, built by Broadcom
jamdesk · Hacker News · 1d ago
A Herculaneum scroll has been read for the first time
verditelabs · Hacker News · 2h ago