CIRCL
/

vulnerability-severity-classification-roberta-base

Text Classification

Generated from Trainer

Model card Files Files and versions

cedricbonhomme commited on Mar 18

Commit

dedf577

·

verified ·

1 Parent(s): 635754e

Update README.md

Files changed (1) hide show

README.md +31 -7

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ metrics:
 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,22 +18,44 @@ should probably proofread and complete it, then remove this comment. -->
 # vulnerability-severity-classification-roberta-base
-This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.5063
 - Accuracy: 0.8285
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -62,4 +86,4 @@ The following hyperparameters were used during training:
 - Transformers 4.49.0
 - Pytorch 2.6.0+cu124
 - Datasets 3.4.0
-- Tokenizers 0.21.1

 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
+datasets:
+- CIRCL/vulnerability-scores
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # vulnerability-severity-classification-roberta-base
+This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the dataset [CIRCL/vulnerability-scores](https://huggingface.co/datasets/CIRCL/vulnerability-scores).
 It achieves the following results on the evaluation set:
 - Loss: 0.5063
 - Accuracy: 0.8285
 ## Model description
+It is a classification model and is aimed to assist in classifying vulnerabilities by severity based on their descriptions.
+## How to get started with the model
+```python
+from transformers import AutoModelForSequenceClassification, AutoTokenizer
+import torch
+labels = ["low", "medium", "high", "critical"]
+model_name = "CIRCL/vulnerability-scores"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(model_name)
+model.eval()
+test_description = "langchain_experimental 0.0.14 allows an attacker to bypass the CVE-2023-36258 fix and execute arbitrary code via the PALChain in the python exec method."
+inputs = tokenizer(test_description, return_tensors="pt", truncation=True, padding=True)
+# Run inference
+with torch.no_grad():
+    outputs = model(**inputs)
+    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
+# Print results
+print("Predictions:", predictions)
+predicted_class = torch.argmax(predictions, dim=-1).item()
+print("Predicted severity:", labels[predicted_class])
+```
 ## Training procedure
 - Transformers 4.49.0
 - Pytorch 2.6.0+cu124
 - Datasets 3.4.0
+- Tokenizers 0.21.1