A preliminary experiment regarding consistency as a measure of conceptual abilities in language models

·LessWrong··

Cross-posting from my coworker Caspar Oesterheld's blog which I think is great and generally not well known.I’ve recently been working a little on whether consistency across different questions can be used as a measure of (and perhaps ultimately as a training target for) philosophical competence. I’m in the process of writing up the results into a paper. I’m here reporting results from a small, preliminary experiment that I ran late last year. I’ll leave a more careful discussion (with a proper ...

Read full article →

Related Articles

Lore – Open source version control system designed for scalability
regnerba · Hacker News · 9h ago
Volkswagen started blocking GrapheneOS users
microtonal · Hacker News · 9h ago
Leaked financial docs show OpenAI is losing billions of dollars a year
greenchair · Hacker News · 2h ago
US holds off blacklisting DeepSeek, more than 100 firms deemed security risks
giuliomagnifico · Hacker News · 20h ago
RFC 10008: The new HTTP Query Method
schappim · Hacker News · 13h ago