End of training

Browse files

Files changed (2) hide show

README.md +20 -55
emissions.csv +1 -1

README.md CHANGED Viewed

@@ -1,68 +1,37 @@
 ---
 library_name: transformers
-license: cc-by-4.0
 base_model: roberta-base
-metrics:
-- accuracy
 tags:
 - generated_from_trainer
-- text-classification
-- classification
-- nlp
-- vulnerability
 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
-datasets:
-- CIRCL/vulnerability-scores
 ---
-# VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification
-# Severity classification
-This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the dataset [CIRCL/vulnerability-scores](https://huggingface.co/datasets/CIRCL/vulnerability-scores).
-The model was presented in the paper [VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification](https://huggingface.co/papers/2507.03607) [[arXiv](https://arxiv.org/abs/2507.03607)].
-**Abstract:** VLAI is a transformer-based model that predicts software vulnerability severity levels directly from text descriptions. Built on RoBERTa, VLAI is fine-tuned on over 600,000 real-world vulnerabilities and achieves over 82% accuracy in predicting severity categories, enabling faster and more consistent triage ahead of manual CVSS scoring. The model and dataset are open-source and integrated into the Vulnerability-Lookup service.
-You can read [this page](https://www.vulnerability-lookup.org/user-manual/ai/) for more information.
 ## Model description
-It is a classification model and is aimed to assist in classifying vulnerabilities by severity based on their descriptions.
-## How to get started with the model
-```python
-from transformers import AutoModelForSequenceClassification, AutoTokenizer
-import torch
-labels = ["low", "medium", "high", "critical"]
-model_name = "CIRCL/vulnerability-severity-classification-roberta-base"
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForSequenceClassification.from_pretrained(model_name)
-model.eval()
-test_description = "SAP NetWeaver Visual Composer Metadata Uploader is not protected with a proper authorization, allowing unauthenticated agent to upload potentially malicious executable binaries \
-that could severely harm the host system. This could significantly affect the confidentiality, integrity, and availability of the targeted system."
-inputs = tokenizer(test_description, return_tensors="pt", truncation=True, padding=True)
-# Run inference
-with torch.no_grad():
-    outputs = model(**inputs)
-    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
-# Print results
-print("Predictions:", predictions)
-predicted_class = torch.argmax(predictions, dim=-1).item()
-print("Predicted severity:", labels[predicted_class])
-```
 ## Training procedure
@@ -77,19 +46,15 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
-It achieves the following results on the evaluation set:
-- Loss: 0.4942
-- Accuracy: 0.8233
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.7037        | 1.0   | 15125 | 0.6343          | 0.7375   |
-| 0.54          | 2.0   | 30250 | 0.5637          | 0.7730   |
-| 0.4651        | 3.0   | 45375 | 0.5438          | 0.7928   |
-| 0.4133        | 4.0   | 60500 | 0.5025          | 0.8131   |
-| 0.3927        | 5.0   | 75625 | 0.4942          | 0.8233   |
 ### Framework versions

 ---
 library_name: transformers
+license: mit
 base_model: roberta-base
 tags:
 - generated_from_trainer
+metrics:
+- accuracy
 model-index:
 - name: vulnerability-severity-classification-roberta-base
   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# vulnerability-severity-classification-roberta-base
+This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4941
+- Accuracy: 0.8231
 ## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
 ## Training procedure
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.6002        | 1.0   | 15170 | 0.6387          | 0.7427   |
+| 0.5361        | 2.0   | 30340 | 0.5631          | 0.7751   |
+| 0.4608        | 3.0   | 45510 | 0.5208          | 0.7970   |
+| 0.3383        | 4.0   | 60680 | 0.4975          | 0.8150   |
+| 0.3325        | 5.0   | 75850 | 0.4941          | 0.8231   |
 ### Framework versions

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	- 2026-01-~~19T10~~:49:01,codecarbon,~~e47f1fc5~~-~~7153~~-~~4bef~~-~~b60f~~-~~6ecee5cfea74~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~14269~~.~~086253645946~~,0.~~5400655046401097~~,3.~~7848639712449396e~~-05,364.~~51917863207274~~,~~862~~.~~2434481668117~~,70.0,1.~~443068396622125~~,3.~~410740521090237~~,0.~~2768210723136751~~,5.~~130629990026026~~,0.0,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,3.2.1,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354835510254,machine,0.~~6738405236117954~~,73.~~27106059539729~~,1.~~0799985924414104~~,21.~~291425154725776~~,N,1.0,0.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	+ 2026-01-23T11:36:20,codecarbon,8125681a-5faa-4b2c-b95b-40fa3f27a3d2,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,14283.482265740167,0.5416504412498306,3.7921455788761915e-05,364.6319145477701,864.5935561562808,70.0,1.4450226164935585,3.4235609066243096,0.2771033872858828,5.145686910403756,0.0,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,3.2.1,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354835510254,machine,0.6635526500773231,73.5131985940246,1.0799100238999018,21.419206692199406,N,1.0,0.0