CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24 • 21
Llama 3.3 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.3 models, including the configurations • 1 item • Updated 19 days ago • 10
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 19 days ago • 92