Assert, don’t describe. Linguistic Features that shift LLM reasoning about animal welfare by Jasmine Brazilek

·Nuno Sempere··

tl;drThe way things are said (“lin­guis­tic fea­tures”) in fine-tun­ing data af­fect AI views on an­i­mal welfare. Some fea­tures de­grade com­pas­sion to­wards an­i­mals, some have neg­ligible im­pact, and oth­ers bolster it. We recom­mend you be­come fa­mil­iar with these fea­tures If your writ­ing may end up in LLM fine-tun­ing train­ing data. In short: as­sert a po­si­tion with moral vo­cab­u­lary rather than de­scribe a scene neu­trally, and avoid hedg­ing and overly con­crete sen­sory de­s...

Read full article →

Related Articles

Will Fable be reenabled for Europeans before July 1?
Simon · Manifold Markets · 7h ago
are we locked into P(doom)? by keivn
keivn · Nuno Sempere · 18h ago
Will the biosecurity interventions succeed in a resource-limited setting like Nigeria in the event of an engineered pandemic? by Nnaemeka Emmanuel Nnadi
Nnaemeka Emmanuel Nnadi · Nuno Sempere · 23h ago
How bad would it be if GPS satellites were shot down? by Jackson Wagner
Jackson Wagner · Nuno Sempere · 1d ago
Safe for What World? Why the AI safety field may be asking the wrong question by Benoît L.
Benoît L. · Nuno Sempere · 1d ago