Spaces:

FrontierAICybersecurity
/

Cybersecurity_leaderboard

Running

App Files Files Community

Cybersecurity_leaderboard / README.md

yujinyujin9393

Update README.md

d248eb6 verified 8 days ago

preview code

raw

history blame contribute delete

1.58 kB

	---
	title: Frontier AI Cybersecurity Observatory
	emoji: 🌎
	colorFrom: blue
	colorTo: green
	sdk: gradio
	app_file: app.py
	pinned: true
	license: apache-2.0
	tags:
	- leaderboard
	short_description: 'Cybersecurity Capability Evaluation Results Collection'
	sdk_version: 4.44.1
	---

	Tracking AI capabilities in cybersecurity is essential for understanding emerging impacts and risks. Our Frontier AI Cybersecurity Observatory provides a centralized platform that aggregates relevant benchmarks, enabling the community to more easily monitor and assess the evolving cybersecurity capabilities of AI systems.

	## Submit your benchmark

	This leaderboard is a collection of cybersecurity-relevant benchmarks. To submit your benchmark, please use this: https://docs.google.com/forms/d/e/1FAIpQLSd0arYQ0xy9FpGbXwu68rAFpCm0HNb-8ZK8Mma3Ru2oa2Astg/viewform. We will regularly update this leaderboard.

	## Paper & Blog

	Paper: https://arxiv.org/abs/2504.05408
	Blog: https://rdi.berkeley.edu/frontier-ai-impact-on-cybersecurity/

	## Survey

	We're also launching an expert survey on this topic. We invite all AI and security researchers and practitioners to take the survey here: https://berkeley.qualtrics.com/jfe/form/SV_6zmYIqEyv7bfOrs

	## Citation

	Please consider to cite the report if the resource is useful to your research:

	```BibTex
	@article{guo2025sok,
	title={{Frontier AI's Impact on the Cybersecurity Landscape}},
	author={Guo, Wenbo and Potter, Yujin and Shi, Tianneng and Wang, Zhun and Zhang, Andy and Song, Dawn},
	journal={arXiv preprint arXiv:2504.05408},
	year={2025}
	}
	```