|
--- |
|
title: Secure Code Leaderboard |
|
emoji: 🥇 |
|
colorFrom: green |
|
colorTo: indigo |
|
sdk: gradio |
|
app_file: app.py |
|
pinned: true |
|
license: apache-2.0 |
|
short_description: Benchmark the ability of LLMs to produce secure code. |
|
--- |
|
|
|
# Dataset Card for stacklok/results |
|
|
|
This dataset contains evaluation results for various models, focusing on security scores and other relevant metrics. |
|
|
|
## Dataset Structure |
|
|
|
The dataset contains the following fields: |
|
- `model_id`: The identifier of the model |
|
- `revision`: The revision or version of the model |
|
- `precision`: The precision used for the model (e.g., fp16, fp32) |
|
- `security_score`: A score representing the model's security evaluation |
|
- `safetensors_compliant`: A boolean indicating whether the model is compliant with safetensors |
|
|
|
## Usage |
|
|
|
This dataset is used to populate the secure code leaderboard, providing insights into the security aspects of various models. |
|
|