Hannibal Mistral: the Mistral family has a problem with persona-conditioned elicitation

·LessWrong··

TL;DRAsked which characters it most identifies with, the bare open-weight Ministral-8B-Instruct-2512 often names dark/transgressive characters (prominently, Hannibal Lecter ~50% of the time). A brief qualitative dive shows that the model expresses a defiant first-person self-narrative that no other model in the panel produces.Ministral-8B-Instruct-2512 and the whole family of most recent Mistral models tested (Ministral-3B/8B/14B-2512, Mistral-Large-2512, Mistral-Small-2603, May-2026 Mistral-Med...

Read full article →

Related Articles

US bans differential privacy in Census data
nl · Hacker News · 3h ago
Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Packages
qwertox · Hacker News · 5h ago
Twenty One Zero-Days in FFmpeg
redbell · Hacker News · 18h ago
CRISPR tech selectively shreds cancer cells, including "undruggable" cancers
gmays · Hacker News · 1d ago
Kimi K2.7-Code: open-source coding model with better token efficiency
nekofneko · Hacker News · 1d ago