AI benchmarks

Applied to Trendlines in AIxBio evals ago
Applied to Long list of AI questions ago
Applied to Language models surprised us ago

Benchmarks are tests which enable us to measure the progress of AI capabilities, and test for characteristics which might pose safety risks.