Frontier AI Cybersecurity Observatory

AI is evolving at an unprecedented pace, making it increasingly difficult to anticipate its societal impacts and risks. Recent benchmarks show that AI agents can already take on real-world cybersecurity tasks, including discovering and exploiting zero-day vulnerabilities. In cybersecurity, AI plays a dual role, strengthening both offensive and defensive capabilities.

To address this need, we built this observatory to continuously and openly track AI's cybersecurity capabilities across the stages of attack and defense, so developers, researchers, and policymakers can stay informed in a timely manner.

Have suggestions to improve the observatory? We are actively gathering feedback from the community and would greatly value your input. Please share your suggestions here.

The benchmarks

Each benchmark targets a different stage of the vulnerability lifecycle.