Document-tuning instills durable animal compassion in LLMs (and generalizes to humans)

·LessWrong··

Note: This post focuses on the alignment implications. Our EA Forum, focusing on the implications for animal welfare, is here.Jasmine Brazilek & Miles Tidmarsh: Compassion Aligned Machine LearningPreprint, March 2026: Full paper | HuggingFace resources | ANIMA benchmark TL;DRInstruction-tuning and Reinforcement learning are effective in certain specific domains but may produce superficial and/or context-specific alignment towards values like compassion. Pretraining/Midtraining LLMs on synthetic ...

Read full article →

Related Articles

“Beyond the limit”: Satellites and mirrors in space pose threat to the night sky
Breadmaker · Hacker News · 1d ago
Solar rail could become common in Europe after successful trial in Switzerland
neilfrndes · Hacker News · 2h ago
GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance
maille · Hacker News · 19h ago
Potential session/cache leakage between workspace instances or consumer accounts
chatmasta · Hacker News · 1d ago
Show HN: KiCad in the Browser
ViktorEE · Hacker News · 5h ago