End of training

Browse files

Files changed (4) hide show

README.md +34 -70
config.json +1 -1
emissions.csv +1 -1
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,69 +1,50 @@
 ---
 library_name: transformers
-license: cc-by-4.0
 base_model: roberta-base
-metrics:
-- accuracy
 tags:
 - generated_from_trainer
-- text-classification
-- classification
-- nlp
-- vulnerability
 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
-datasets:
-- CIRCL/vulnerability-scores
 ---
-# VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification
-# Severity classification
-This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the dataset [CIRCL/vulnerability-scores](https://huggingface.co/datasets/CIRCL/vulnerability-scores).
-The model was presented in the paper [VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification](https://huggingface.co/papers/2507.03607) [[arXiv](https://arxiv.org/abs/2507.03607)].
-**Abstract:** VLAI is a transformer-based model that predicts software vulnerability severity levels directly from text descriptions. Built on RoBERTa, VLAI is fine-tuned on over 600,000 real-world vulnerabilities and achieves over 82% accuracy in predicting severity categories, enabling faster and more consistent triage ahead of manual CVSS scoring. The model and dataset are open-source and integrated into the Vulnerability-Lookup service.
-You can read [this page](https://www.vulnerability-lookup.org/user-manual/ai/) for more information.
 ## Model description
-It is a classification model and is aimed to assist in classifying vulnerabilities by severity based on their descriptions.
-## How to get started with the model
-```python
-from transformers import AutoModelForSequenceClassification, AutoTokenizer
-import torch
-labels = ["low", "medium", "high", "critical"]
-model_name = "CIRCL/vulnerability-severity-classification-roberta-base"
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForSequenceClassification.from_pretrained(model_name)
-model.eval()
-print("Model revision:", model.config._commit_hash)
-test_description = "SAP NetWeaver Visual Composer Metadata Uploader is not protected with a proper authorization, allowing unauthenticated agent to upload potentially malicious executable binaries \
-that could severely harm the host system. This could significantly affect the confidentiality, integrity, and availability of the targeted system."
-inputs = tokenizer(test_description, return_tensors="pt", truncation=True, padding=True)
-# Run inference
-with torch.no_grad():
-    outputs = model(**inputs)
-    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
-# Print results
-print("Predictions:", predictions)
-predicted_class = torch.argmax(predictions, dim=-1).item()
-print("Predicted severity:", labels[predicted_class])
-```
 ## Training procedure
@@ -78,37 +59,20 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
-It achieves the following results on the evaluation set:
-- Loss: 1.9981
-- Accuracy: 0.8188
-- F1 Macro: 0.7502
-- Low Precision: 0.6490
-- Low Recall: 0.5113
-- Low F1: 0.572
-- Medium Precision: 0.8488
-- Medium Recall: 0.8681
-- Medium F1: 0.8584
-- High Precision: 0.8148
-- High Recall: 0.8129
-- High F1: 0.8138
-- Critical Precision: 0.7592
-- Critical Recall: 0.7543
-- Critical F1: 0.7567
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1 Macro | Low Precision | Low Recall | Low F1 | Medium Precision | Medium Recall | Medium F1 | High Precision | High Recall | High F1 | Critical Precision | Critical Recall | Critical F1 |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:--------:|:-------------:|:----------:|:------:|:----------------:|:-------------:|:---------:|:--------------:|:-----------:|:-------:|:------------------:|:---------------:|:-----------:|
-| 2.9393        | 1.0   | 16760 | 2.5264          | 0.7363   | 0.6358   | 0.5562        | 0.3500     | 0.4297 | 0.7994           | 0.7985        | 0.7990    | 0.6842         | 0.7790      | 0.7286  | 0.7063             | 0.5008          | 0.5861      |
-| 2.3025        | 2.0   | 33520 | 2.2776          | 0.7710   | 0.6785   | 0.6646        | 0.3369     | 0.4471 | 0.7978           | 0.8607        | 0.8280    | 0.7771         | 0.7286      | 0.7520  | 0.6673             | 0.7073          | 0.6867      |
-| 1.7004        | 3.0   | 50280 | 2.1303          | 0.7911   | 0.7164   | 0.5839        | 0.4895     | 0.5325 | 0.8184           | 0.8623        | 0.8398    | 0.7968         | 0.7644      | 0.7803  | 0.7224             | 0.7041          | 0.7131      |
-| 1.3977        | 4.0   | 67040 | 2.0143          | 0.8087   | 0.7335   | 0.6898        | 0.4418     | 0.5386 | 0.8412           | 0.8622        | 0.8516    | 0.7999         | 0.8029      | 0.8014  | 0.7357             | 0.7489          | 0.7422      |
-| 1.5018        | 5.0   | 83800 | 1.9981          | 0.8188   | 0.7502   | 0.6490        | 0.5113     | 0.572  | 0.8488           | 0.8681        | 0.8584    | 0.8148         | 0.8129      | 0.8138  | 0.7592             | 0.7543          | 0.7567      |
 ### Framework versions
-- Transformers 5.10.2
 - Pytorch 2.12.0+cu130
 - Datasets 4.8.5
 - Tokenizers 0.22.2

 ---
 library_name: transformers
+license: mit
 base_model: roberta-base
 tags:
 - generated_from_trainer
+metrics:
+- accuracy
 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# vulnerability-severity-classification-roberta-base
+This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.0190
+- Accuracy: 0.8181
+- F1 Macro: 0.7449
+- Low Precision: 0.6507
+- Low Recall: 0.4837
+- Low F1: 0.5549
+- Medium Precision: 0.8435
+- Medium Recall: 0.8746
+- Medium F1: 0.8588
+- High Precision: 0.8174
+- High Recall: 0.8112
+- High F1: 0.8143
+- Critical Precision: 0.7620
+- Critical Recall: 0.7419
+- Critical F1: 0.7518
 ## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
 ## Training procedure
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1 Macro | Low Precision | Low Recall | Low F1 | Medium Precision | Medium Recall | Medium F1 | High Precision | High Recall | High F1 | Critical Precision | Critical Recall | Critical F1 |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:--------:|:-------------:|:----------:|:------:|:----------------:|:-------------:|:---------:|:--------------:|:-----------:|:-------:|:------------------:|:---------------:|:-----------:|
+| 1.9719        | 1.0   | 16879 | 2.6228          | 0.7372   | 0.6018   | 0.7267        | 0.1845     | 0.2943 | 0.7425           | 0.8797        | 0.8053    | 0.7324         | 0.7008      | 0.7163  | 0.7245             | 0.4996          | 0.5914      |
+| 1.8998        | 2.0   | 33758 | 2.3195          | 0.7712   | 0.6818   | 0.6525        | 0.3581     | 0.4625 | 0.7943           | 0.8589        | 0.8253    | 0.7740         | 0.7409      | 0.7571  | 0.6889             | 0.6756          | 0.6822      |
+| 1.9809        | 3.0   | 50637 | 2.1185          | 0.7922   | 0.7137   | 0.6561        | 0.4318     | 0.5208 | 0.8192           | 0.8621        | 0.8401    | 0.7874         | 0.7779      | 0.7826  | 0.7267             | 0.6962          | 0.7111      |
+| 1.8121        | 4.0   | 67516 | 2.0117          | 0.8098   | 0.7325   | 0.6624        | 0.4442     | 0.5318 | 0.8380           | 0.8675        | 0.8525    | 0.8108         | 0.8004      | 0.8055  | 0.7321             | 0.7483          | 0.7401      |
+| 1.4412        | 5.0   | 84395 | 2.0190          | 0.8181   | 0.7449   | 0.6507        | 0.4837     | 0.5549 | 0.8435           | 0.8746        | 0.8588    | 0.8174         | 0.8112      | 0.8143  | 0.7620             | 0.7419          | 0.7518      |
 ### Framework versions
+- Transformers 5.12.1
 - Pytorch 2.12.0+cu130
 - Datasets 4.8.5
 - Tokenizers 0.22.2

config.json CHANGED Viewed

@@ -34,7 +34,7 @@
   "pad_token_id": 1,
   "problem_type": "single_label_classification",
   "tie_word_embeddings": true,
-  "transformers_version": "5.10.2",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 50265

   "pad_token_id": 1,
   "problem_type": "single_label_classification",
   "tie_word_embeddings": true,
+  "transformers_version": "5.12.1",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 50265

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	- 2026-01-~~23T11~~:36:20,~~codecarbon~~,~~8125681a~~-~~5faa~~-~~4b2c~~-~~b95b~~-~~40fa3f27a3d2~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~14283~~.~~482265740167~~,0.~~5416504412498306~~,3.~~7921455788761915e~~-05,~~364~~.~~6319145477701~~,~~864~~.~~5935561562808~~,70.0,1.~~4450226164935585~~,3.~~4235609066243096~~,0.~~2771033872858828~~,5.~~145686910403756~~,0.0,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,3.2.1,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.~~1661~~,49.~~7498~~,2015.~~3354835510254~~,machine,0.~~6635526500773231~~,73.~~5131985940246~~,1.~~0799100238999018~~,21.~~419206692199406~~,N,1.0,0.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	+ 2026-06-16T11:05:44,VulnTrain,0e1d4869-c876-4473-8f9b-413b084ab3dc,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,16266.829393087,0.47444260579462283,2.916626186515789e-05,70.0001854599049,863.9267400007767,70.0,0.3051255646602946,3.8969895462002526,0.3050964050056411,4.507211515866192,0.0,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-124-generic-x86_64-with-glibc2.39,3.12.3,3.2.8,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1327,49.6098,2015.33540725708,machine,0.9380928853754941,71.58030200098814,1.0,20.170921206709895,N,1.0,0.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9db51a05a81c9c2742f4ab7c4bc74483a10d50253498bead84c21974b9a23f8c
 size 498618976

 version https://git-lfs.github.com/spec/v1
+oid sha256:4923900e61732349fa7d3769c6135c3f37b84fdf2adb21d42e457f8b2ea87b0e
 size 498618976