This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
Topics
EA Forum
Login
Sign up
AI evaluations and standards
•
Applied to
Meta: Frontier AI Framework
2d
ago
•
Applied to
Whose track record of AI predictions would you like to see evaluated?
7d
ago
•
Applied to
AI Audit in Costa Rica
10d
ago
•
Applied to
Rolling Thresholds for AGI Scaling Regulation
25d
ago
•
Applied to
o3
2mo
ago
•
Applied to
The "low-hanging fruits" of AI safety
2mo
ago
•
Applied to
I read every major AI lab’s safety plan so you don’t have to
2mo
ago
•
Applied to
OpenAI's o1 tried to avoid being shut down, and lied about it, in evals
2mo
ago
•
Applied to
OpenAI's CBRN tests seem unclear
2mo
ago
•
Applied to
College technical AI safety hackathon retrospective - Georgia Tech
3mo
ago
•
Applied to
Comparing AI Labs and Pharmaceutical Companies
3mo
ago
•
Applied to
The current state of RSPs
3mo
ago
•
Applied to
Trendlines in AIxBio evals
3mo
ago
•
Applied to
Announcing ForecastBench, a new benchmark for AI and human forecasting abilities
4mo
ago
•
Applied to
Join the $10K AutoHack 2024 Tournament
4mo
ago
•
Applied to
Model evals for dangerous capabilities
4mo
ago
•
Applied to
Submit Your Toughest Questions for Humanity's Last Exam
5mo
ago
•
Applied to
Thinking About Propensity Evaluations
6mo
ago
•
Applied to
A Taxonomy Of AI System Evaluations
6mo
ago