This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
Topics
EA Forum
Login
Sign up
Hide table of contents
AI benchmarks
Edit
History
Discussion
(2)
Subscribe
Edit
History
Discussion
(2)
AI benchmarks
Further reading
Related entries
Random Topic
Contributors
4
MichaelA
You are viewing revision 1.3.0, last edited by
MichaelA
...
(Read more)
Posts tagged AI benchmarks
Most relevant
Relevance
35
Trendlines in AIxBio evals
ljusten
ljusten
+ 0 more
·
8d
ago
· 14m read
2
4
2
4
12
Open Phil releases RFPs on LLM Benchmarks and Forecasting
Lawrence Chan
Lawrence Chan
+ 0 more
·
1y
ago
0
3
0
3
193
Results from an Adversarial Collaboration on AI Risk (FRI)
Forecasting Research Institute
Forecasting Research Institute
,
Jhrosenberg
,
AvitalM
,
Molly Hickman
,
rosehadshar
+ 0 more
·
8mo
ago
· 11m read
25
2
25
2
127
Announcing Epoch’s dashboard of key trends and figures in Machine Learning
Jaime Sevilla
Jaime Sevilla
+ 0 more
·
2y
ago
4
2
4
2
124
Long list of AI questions
NunoSempere
NunoSempere
,
David Mathers🔸
,
Misha_Yagudin
,
Gavin
+ 0 more
·
1y
ago
· 103m read
14
2
14
2
96
A compute-based framework for thinking about the future of AI
Matthew_Barnett
Matthew_Barnett
+ 0 more
·
1y
ago
· 23m read
36
2
36
2
78
AI Forecasting Research Ideas
Jaime Sevilla
Jaime Sevilla
,
lennart
+ 0 more
·
2y
ago
· 1m read
1
2
1
2
59
Language models surprised us
Ajeya
Ajeya
+ 0 more
·
1y
ago
· 6m read
10
2
10
2
56
Prizes for ML Safety Benchmark Ideas
Joshc
Joshc
,
Dan H
+ 0 more
·
2y
ago
· 1m read
8
2
8
2
47
$250K in Prizes: SafeBench Competition Announcement
Center for AI Safety
Center for AI Safety
+ 0 more
·
7mo
ago
· 2m read
0
2
0
2
38
Announcing Epoch's newly expanded Parameters, Compute and Data Trends in Machine Learning database
Robi Rahman
Robi Rahman
,
Jaime Sevilla
+ 0 more
·
1y
ago
· 1m read
1
2
1
2
38
Survey on the acceleration risks of our new RFPs to study LLM capabilities
Ajeya
Ajeya
+ 0 more
·
1y
ago
· 10m read
1
2
1
2
37
XPT forecasts on (some) Direct Approach model inputs
Forecasting Research Institute
Forecasting Research Institute
,
rosehadshar
+ 0 more
·
1y
ago
· 10m read
0
2
0
2
51
Race to the Top: Benchmarks for AI Safety
isaduan
isaduan
+ 0 more
·
2y
ago
· 2m read
8
1
8
1
50
Announcing the AI Forecasting Benchmark Series | July 8, $120k in Prizes
christian
christian
+ 0 more
·
5mo
ago
· 5m read
4
1
4
1