Recursive forecasting

·Redwood Research··

We’d like to use powerful AIs to answer questions that may take a long time to resolve. But if a model only cares about performing well in ways that are verifiable shortly after answering (e.g., a myopic fitness seeker), it may be difficult to get useful work from it on questions that resolve much later.In this post, I’ll describe a proposal for eliciting good long-horizon forecasts from these models. Instead of asking a model to directly predict a far-future outcome, we can recursively:Ask it t...

Read full article →

Related Articles

The Case for Evaluating Model Behaviors
jsteinhardt · Alignment Forum · 21h ago
Mechanistic estimation for expectations of random products
Jacob Hilton · ARC · 5d ago
Multipolar Civilisation Depends on Maintaining an Attacker’s Dilemma
Naci Cankaya · LessWrong · 14d ago
Blind deep-deployment evals for control & sabotage
Dylan Bowman · LessWrong · 14d ago
Survival over Scrutiny: Mapping the Breakdown of Constitutional Alignment
Nishka Katoch · LessWrong · 14d ago