A new benchmark aims to measure and mitigate AI-related dangers.
― 5 min read
Cutting edge science explained simply
A new benchmark aims to measure and mitigate AI-related dangers.
― 5 min read
Learn how sandbagging affects AI assessments and ways to detect it.
― 6 min read