Benchmarks are tests which enable us to measure the progress of AI capabilities, and test for characteristics which might pose safety risks.
(Read more)