Agent, Know Thyself! (and bid accordingly)
Written with the wonderful Andrey Fradkin, who does the Justified Posteriors podcast.Attention conservation notice: We developed a new benchmark, MarketBench, and scaffold. Based on our findings, we argue that self-assessment of capabilities and costs is a key capability, and it needs to be a target of training. This is work in progress, and we are looking for collaborators and funding to pursue this research. Paper here. Repo here.Let’s say you have a large-scale project to work on. How do you ...
Read full article →