Hide table of contents

How capable will top AI models be in 2025? 

Forecast LLM agents' autonomous replication & adaptation (ARA) abilities and model performance on benchmarks like GPQA & GAIA in AI Benchmarks, a collaboration with the AI Safety Student Team at Harvard (AISST). 

Start here

AISST questions are inspired by work by @elifland

10

0
0

Reactions

0
0
Comments
No comments on this post yet.
Be the first to respond.
Curated and popular this week
Relevant opportunities