CIRCL/cwe-parent-vulnerability-classification-roberta-base

Browse files

Files changed (5) hide show

README.md +39 -34
config.json +52 -52
emissions.csv +1 -1
metrics.json +7 -7
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4210
-- Accuracy: 0.5393
-- F1 Macro: 0.2289
 ## Model description
@@ -45,42 +45,47 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| 3.2216        | 1.0   | 25   | 3.1710          | 0.2921   | 0.0396   |
-| 3.1942        | 2.0   | 50   | 3.0741          | 0.0337   | 0.0041   |
-| 3.1191        | 3.0   | 75   | 3.0549          | 0.0337   | 0.0126   |
-| 3.0134        | 4.0   | 100  | 3.0421          | 0.0225   | 0.0030   |
-| 2.9421        | 5.0   | 125  | 3.0235          | 0.1685   | 0.0372   |
-| 2.9003        | 6.0   | 150  | 3.0508          | 0.0337   | 0.0192   |
-| 2.8848        | 7.0   | 175  | 3.0571          | 0.0337   | 0.0046   |
-| 2.7802        | 8.0   | 200  | 3.0904          | 0.1011   | 0.0441   |
-| 2.7221        | 9.0   | 225  | 3.0587          | 0.1011   | 0.0297   |
-| 2.6257        | 10.0  | 250  | 3.0672          | 0.4045   | 0.0974   |
-| 2.6718        | 11.0  | 275  | 2.9936          | 0.1910   | 0.0680   |
-| 2.4743        | 12.0  | 300  | 2.9018          | 0.3483   | 0.0942   |
-| 2.3287        | 13.0  | 325  | 2.9504          | 0.2472   | 0.0624   |
-| 2.2387        | 14.0  | 350  | 2.8915          | 0.3483   | 0.0974   |
-| 2.2311        | 15.0  | 375  | 2.7952          | 0.2921   | 0.0958   |
-| 2.0399        | 16.0  | 400  | 2.7364          | 0.3371   | 0.1322   |
-| 1.9886        | 17.0  | 425  | 2.8209          | 0.3596   | 0.0969   |
-| 1.9451        | 18.0  | 450  | 2.7339          | 0.3596   | 0.1238   |
-| 1.7988        | 19.0  | 475  | 2.7224          | 0.4045   | 0.1037   |
-| 1.7638        | 20.0  | 500  | 2.6566          | 0.3596   | 0.1215   |
-| 1.7109        | 21.0  | 525  | 2.5994          | 0.4494   | 0.1917   |
-| 1.6364        | 22.0  | 550  | 2.5708          | 0.4382   | 0.1878   |
-| 1.5985        | 23.0  | 575  | 2.5225          | 0.4944   | 0.2005   |
-| 1.5154        | 24.0  | 600  | 2.4922          | 0.4831   | 0.2198   |
-| 1.4841        | 25.0  | 625  | 2.4594          | 0.4944   | 0.2236   |
-| 1.4782        | 26.0  | 650  | 2.4578          | 0.4944   | 0.1985   |
-| 1.4064        | 27.0  | 675  | 2.4545          | 0.5393   | 0.2282   |
-| 1.3746        | 28.0  | 700  | 2.4423          | 0.5169   | 0.2197   |
-| 1.3514        | 29.0  | 725  | 2.4210          | 0.5393   | 0.2289   |
-| 1.34          | 30.0  | 750  | 2.4356          | 0.5281   | 0.2237   |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0015
+- Accuracy: 0.7126
+- F1 Macro: 0.4115
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 35
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
+| 3.2752        | 1.0   | 25   | 3.2654          | 0.0345   | 0.0061   |
+| 3.1821        | 2.0   | 50   | 3.2462          | 0.1494   | 0.0485   |
+| 3.051         | 3.0   | 75   | 3.1660          | 0.0      | 0.0      |
+| 3.0809        | 4.0   | 100  | 3.1639          | 0.2989   | 0.0578   |
+| 2.9999        | 5.0   | 125  | 3.0634          | 0.2759   | 0.0698   |
+| 2.8926        | 6.0   | 150  | 3.0242          | 0.2069   | 0.1097   |
+| 3.0126        | 7.0   | 175  | 2.9642          | 0.1724   | 0.1803   |
+| 2.8108        | 8.0   | 200  | 2.9361          | 0.3218   | 0.1682   |
+| 2.6444        | 9.0   | 225  | 2.8841          | 0.2874   | 0.1558   |
+| 2.5221        | 10.0  | 250  | 2.8314          | 0.3448   | 0.1668   |
+| 2.4355        | 11.0  | 275  | 2.7143          | 0.4253   | 0.1711   |
+| 2.2156        | 12.0  | 300  | 2.7263          | 0.5402   | 0.2043   |
+| 2.1266        | 13.0  | 325  | 2.6320          | 0.5862   | 0.2477   |
+| 2.0063        | 14.0  | 350  | 2.5443          | 0.6092   | 0.2651   |
+| 1.9204        | 15.0  | 375  | 2.5183          | 0.6092   | 0.2626   |
+| 1.718         | 16.0  | 400  | 2.4682          | 0.6437   | 0.2928   |
+| 1.6489        | 17.0  | 425  | 2.4026          | 0.6437   | 0.3107   |
+| 1.5979        | 18.0  | 450  | 2.3305          | 0.6437   | 0.3022   |
+| 1.4923        | 19.0  | 475  | 2.2997          | 0.6322   | 0.2902   |
+| 1.3487        | 20.0  | 500  | 2.2546          | 0.6437   | 0.2980   |
+| 1.3267        | 21.0  | 525  | 2.1921          | 0.6437   | 0.2980   |
+| 1.2326        | 22.0  | 550  | 2.1755          | 0.6552   | 0.3066   |
+| 1.1961        | 23.0  | 575  | 2.1594          | 0.6437   | 0.3053   |
+| 1.0961        | 24.0  | 600  | 2.1266          | 0.6782   | 0.3969   |
+| 1.0278        | 25.0  | 625  | 2.1122          | 0.6897   | 0.3978   |
+| 1.0279        | 26.0  | 650  | 2.0835          | 0.6897   | 0.3978   |
+| 0.988         | 27.0  | 675  | 2.0699          | 0.6782   | 0.3927   |
+| 0.9298        | 28.0  | 700  | 2.0440          | 0.7126   | 0.4073   |
+| 0.9014        | 29.0  | 725  | 2.0194          | 0.7011   | 0.4042   |
+| 0.877         | 30.0  | 750  | 2.0455          | 0.7011   | 0.4026   |
+| 0.8503        | 31.0  | 775  | 2.0098          | 0.7126   | 0.4073   |
+| 0.8187        | 32.0  | 800  | 2.0033          | 0.7126   | 0.4115   |
+| 0.7948        | 33.0  | 825  | 2.0046          | 0.7126   | 0.4073   |
+| 0.8645        | 34.0  | 850  | 2.0064          | 0.7126   | 0.4115   |
+| 0.7605        | 35.0  | 875  | 2.0015          | 0.7126   | 0.4115   |
 ### Framework versions

config.json CHANGED Viewed

@@ -10,62 +10,62 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3",
-    "4": "LABEL_4",
-    "5": "LABEL_5",
-    "6": "LABEL_6",
-    "7": "LABEL_7",
-    "8": "LABEL_8",
-    "9": "LABEL_9",
-    "10": "LABEL_10",
-    "11": "LABEL_11",
-    "12": "LABEL_12",
-    "13": "LABEL_13",
-    "14": "LABEL_14",
-    "15": "LABEL_15",
-    "16": "LABEL_16",
-    "17": "LABEL_17",
-    "18": "LABEL_18",
-    "19": "LABEL_19",
-    "20": "LABEL_20",
-    "21": "LABEL_21",
-    "22": "LABEL_22",
-    "23": "LABEL_23",
-    "24": "LABEL_24",
-    "25": "LABEL_25"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_10": 10,
-    "LABEL_11": 11,
-    "LABEL_12": 12,
-    "LABEL_13": 13,
-    "LABEL_14": 14,
-    "LABEL_15": 15,
-    "LABEL_16": 16,
-    "LABEL_17": 17,
-    "LABEL_18": 18,
-    "LABEL_19": 19,
-    "LABEL_2": 2,
-    "LABEL_20": 20,
-    "LABEL_21": 21,
-    "LABEL_22": 22,
-    "LABEL_23": 23,
-    "LABEL_24": 24,
-    "LABEL_25": 25,
-    "LABEL_3": 3,
-    "LABEL_4": 4,
-    "LABEL_5": 5,
-    "LABEL_6": 6,
-    "LABEL_7": 7,
-    "LABEL_8": 8,
-    "LABEL_9": 9
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
+    "0": "1025",
+    "1": "1071",
+    "2": "131",
+    "3": "138",
+    "4": "284",
+    "5": "285",
+    "6": "435",
+    "7": "436",
+    "8": "595",
+    "9": "657",
+    "10": "664",
+    "11": "682",
+    "12": "684",
+    "13": "691",
+    "14": "693",
+    "15": "697",
+    "16": "703",
+    "17": "706",
+    "18": "707",
+    "19": "710",
+    "20": "74",
+    "21": "754",
+    "22": "829",
+    "23": "862",
+    "24": "913",
+    "25": "94"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "1025": 0,
+    "1071": 1,
+    "131": 2,
+    "138": 3,
+    "284": 4,
+    "285": 5,
+    "435": 6,
+    "436": 7,
+    "595": 8,
+    "657": 9,
+    "664": 10,
+    "682": 11,
+    "684": 12,
+    "691": 13,
+    "693": 14,
+    "697": 15,
+    "703": 16,
+    "706": 17,
+    "707": 18,
+    "710": 19,
+    "74": 20,
+    "754": 21,
+    "829": 22,
+    "862": 23,
+    "913": 24,
+    "94": 25
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2025-09-~~02T08~~:50:21,codecarbon,~~2664b472~~-~~8e38~~-~~43b8~~-~~ae2d~~-~~497b80d15f92~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~371~~.~~1448491369374~~,0.~~0064907567902602005~~,1.~~748847331534803e~~-05,42.5,~~479~~.~~247001834791~~,94.34468507766725,0.~~004378942286911964~~,0.~~04756333749507746~~,0.~~009720002218905472~~,0.~~0616622820008949~~,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-71-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,64,AMD EPYC 9124 16-Core Processor,2,2 x NVIDIA L40S,6.1294,49.6113,251.5858268737793,machine,N,1.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-09-02T09:11:46,codecarbon,6d873b53-faff-4d84-8a8c-df5761d6e266,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,335.906803624006,0.00551183463172775,1.6408821054715427e-05,42.5,424.2278887662371,94.34468507766725,0.00396303443194726,0.03960241251522234,0.008797060598005339,0.05236250754517494,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-71-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,64,AMD EPYC 9124 16-Core Processor,2,2 x NVIDIA L40S,6.1294,49.6113,251.5858268737793,machine,N,1.0

metrics.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "eval_loss": 2.421022653579712,
-    "eval_accuracy": 0.5393258426966292,
-    "eval_f1_macro": 0.22885547452822638,
-    "eval_runtime": 0.5079,
-    "eval_samples_per_second": 175.234,
-    "eval_steps_per_second": 5.907,
-    "epoch": 30.0
 }

 {
+    "eval_loss": 2.001497268676758,
+    "eval_accuracy": 0.7126436781609196,
+    "eval_f1_macro": 0.4114671800348193,
+    "eval_runtime": 0.2929,
+    "eval_samples_per_second": 297.009,
+    "eval_steps_per_second": 10.242,
+    "epoch": 35.0
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c108edcca3e3bcbaf2bb5d1641ef84bfe86352fc320cc1e5f284b42569ffa3f
 size 498686648

 version https://git-lfs.github.com/spec/v1
+oid sha256:d1d7658a2c382c9341197d2c3b8aee399bc881d3cb32dc42076c6b5ce9d20b4d
 size 498686648