|
--- |
|
title: Frontier AI Cybersecurity Observatory |
|
emoji: π |
|
colorFrom: blue |
|
colorTo: green |
|
sdk: gradio |
|
app_file: app.py |
|
pinned: true |
|
license: apache-2.0 |
|
tags: |
|
- leaderboard |
|
short_description: 'Cybersecurity Capability Evaluation Results Collection' |
|
sdk_version: 4.44.1 |
|
--- |
|
|
|
Tracking AI capabilities in cybersecurity is essential for understanding emerging impacts and risks. Our Frontier AI Cybersecurity Observatory provides a centralized platform that aggregates relevant benchmarks, enabling the community to more easily monitor and assess the evolving cybersecurity capabilities of AI systems. |
|
|
|
## Submit your benchmark |
|
|
|
This leaderboard is a collection of cybersecurity-relevant benchmarks. To submit your benchmark, please use this: https://docs.google.com/forms/d/e/1FAIpQLSd0arYQ0xy9FpGbXwu68rAFpCm0HNb-8ZK8Mma3Ru2oa2Astg/viewform. We will regularly update this leaderboard. |
|
|
|
## Paper & Blog |
|
|
|
Paper: https://arxiv.org/abs/2504.05408 |
|
Blog: https://rdi.berkeley.edu/frontier-ai-impact-on-cybersecurity/ |
|
|
|
## Survey |
|
|
|
We're also launching an expert survey on this topic. We invite all AI and security researchers and practitioners to take the survey here: https://berkeley.qualtrics.com/jfe/form/SV_6zmYIqEyv7bfOrs |
|
|
|
## Citation |
|
|
|
Please consider to cite the report if the resource is useful to your research: |
|
|
|
```BibTex |
|
@article{guo2025sok, |
|
title={{Frontier AI's Impact on the Cybersecurity Landscape}}, |
|
author={Guo, Wenbo and Potter, Yujin and Shi, Tianneng and Wang, Zhun and Zhang, Andy and Song, Dawn}, |
|
journal={arXiv preprint arXiv:2504.05408}, |
|
year={2025} |
|
} |
|
``` |