Agent, Know Thyself! (and bid accordingly)

·Strange Loop Canon··

Written with the wonderful Andrey Fradkin, who does the Justified Posteriors podcast.Attention conservation notice: We developed a new benchmark, MarketBench, and scaffold. Based on our findings, we argue that self-assessment of capabilities and costs is a key capability, and it needs to be a target of training. This is work in progress, and we are looking for collaborators and funding to pursue this research. Paper here. Repo here.Let’s say you have a large-scale project to work on. How do you ...

Read full article →

Related Articles

Welfare Biology and AI: The Psychopath, the Nematode, and the Arahant
Dawn Drescher · EA Forum · 4d ago
Immigration changes are driving foreign researchers to leave the U.S. — or not come to begin with 
Andrew Joseph · STAT News · 4d ago
Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation
Garvin Kruthof · ArXiv cs.AI · 4d ago
Looking for papers on general formalizations of "agency"
lovagrus · LessWrong · 5d ago
SFF’s HSEE grant round; human intelligence amplification projects I’d like to see by TsviBT
TsviBT · Nuno Sempere · 8d ago