CIRCL/cwe-parent-vulnerability-classification-roberta-base

Browse files

Files changed (5) hide show

README.md +48 -48
config.json +52 -52
emissions.csv +2 -2
metrics.json +6 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6858
-- Accuracy: 0.6126
-- F1 Macro: 0.3737
 ## Model description
@@ -40,8 +40,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -51,51 +51,51 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| 3.078         | 1.0   | 237  | 3.0510          | 0.1776   | 0.0529   |
-| 2.4726        | 2.0   | 474  | 2.2886          | 0.4398   | 0.2407   |
-| 2.2031        | 3.0   | 711  | 1.9511          | 0.5185   | 0.3141   |
-| 1.7872        | 4.0   | 948  | 1.7893          | 0.5638   | 0.3511   |
-| 1.4324        | 5.0   | 1185 | 1.7492          | 0.6305   | 0.3805   |
-| 1.2675        | 6.0   | 1422 | 1.6858          | 0.6126   | 0.3737   |
-| 1.0437        | 7.0   | 1659 | 1.7359          | 0.6675   | 0.4296   |
-| 0.8699        | 8.0   | 1896 | 1.7641          | 0.6746   | 0.4246   |
-| 0.8832        | 9.0   | 2133 | 1.8097          | 0.6746   | 0.4444   |
-| 0.8027        | 10.0  | 2370 | 1.8753          | 0.6698   | 0.4380   |
-| 0.4583        | 11.0  | 2607 | 1.8919          | 0.6830   | 0.4473   |
-| 0.5493        | 12.0  | 2844 | 1.8456          | 0.7080   | 0.4915   |
-| 0.4808        | 13.0  | 3081 | 1.9593          | 0.6841   | 0.4555   |
-| 0.4466        | 14.0  | 3318 | 2.0736          | 0.6865   | 0.4454   |
-| 0.2989        | 15.0  | 3555 | 2.1972          | 0.6961   | 0.4474   |
-| 0.255         | 16.0  | 3792 | 2.2513          | 0.7008   | 0.4638   |
-| 0.2474        | 17.0  | 4029 | 2.2991          | 0.7223   | 0.4609   |
-| 0.1648        | 18.0  | 4266 | 2.4582          | 0.7128   | 0.4614   |
-| 0.2112        | 19.0  | 4503 | 2.5944          | 0.7247   | 0.4714   |
-| 0.1185        | 20.0  | 4740 | 2.5292          | 0.7128   | 0.4557   |
-| 0.1453        | 21.0  | 4977 | 2.6173          | 0.7104   | 0.4466   |
-| 0.1126        | 22.0  | 5214 | 2.7072          | 0.7104   | 0.4461   |
-| 0.0872        | 23.0  | 5451 | 2.8997          | 0.7235   | 0.4577   |
-| 0.0768        | 24.0  | 5688 | 2.8199          | 0.7294   | 0.4623   |
-| 0.0643        | 25.0  | 5925 | 2.9228          | 0.7211   | 0.4587   |
-| 0.0828        | 26.0  | 6162 | 3.0185          | 0.7330   | 0.4774   |
-| 0.0407        | 27.0  | 6399 | 3.1037          | 0.7211   | 0.4586   |
-| 0.0386        | 28.0  | 6636 | 3.1938          | 0.7235   | 0.4622   |
-| 0.0321        | 29.0  | 6873 | 3.2786          | 0.7318   | 0.4612   |
-| 0.0189        | 30.0  | 7110 | 3.4453          | 0.7330   | 0.4559   |
-| 0.0223        | 31.0  | 7347 | 3.3558          | 0.7366   | 0.4583   |
-| 0.0255        | 32.0  | 7584 | 3.3787          | 0.7354   | 0.4682   |
-| 0.0123        | 33.0  | 7821 | 3.4288          | 0.7306   | 0.4633   |
-| 0.0128        | 34.0  | 8058 | 3.4361          | 0.7366   | 0.4645   |
-| 0.0201        | 35.0  | 8295 | 3.6213          | 0.7235   | 0.4559   |
-| 0.014         | 36.0  | 8532 | 3.7080          | 0.7247   | 0.4554   |
-| 0.0159        | 37.0  | 8769 | 3.6249          | 0.7330   | 0.4622   |
-| 0.027         | 38.0  | 9006 | 3.6598          | 0.7294   | 0.4604   |
-| 0.0086        | 39.0  | 9243 | 3.7176          | 0.7342   | 0.4637   |
-| 0.0096        | 40.0  | 9480 | 3.7223          | 0.7306   | 0.4614   |
 ### Framework versions
-- Transformers 4.57.1
 - Pytorch 2.9.1+cu128
-- Datasets 4.4.1
-- Tokenizers 0.22.1

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7510
+- Accuracy: 0.5455
+- F1 Macro: 0.3776
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 64
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
+| 3.226         | 1.0   | 125  | 3.1362          | 0.0382   | 0.0035   |
+| 3.0244        | 2.0   | 250  | 2.9390          | 0.2155   | 0.1215   |
+| 2.589         | 3.0   | 375  | 2.3469          | 0.4141   | 0.2521   |
+| 2.1614        | 4.0   | 500  | 2.0701          | 0.4355   | 0.2551   |
+| 1.8396        | 5.0   | 625  | 1.9336          | 0.4467   | 0.2748   |
+| 1.5698        | 6.0   | 750  | 1.9086          | 0.4905   | 0.2938   |
+| 1.4142        | 7.0   | 875  | 1.7933          | 0.5174   | 0.3416   |
+| 1.2292        | 8.0   | 1000 | 1.7510          | 0.5455   | 0.3776   |
+| 1.1182        | 9.0   | 1125 | 1.7681          | 0.5713   | 0.3803   |
+| 0.9924        | 10.0  | 1250 | 1.8151          | 0.6083   | 0.4059   |
+| 0.9307        | 11.0  | 1375 | 1.8391          | 0.6218   | 0.4379   |
+| 0.7875        | 12.0  | 1500 | 1.8065          | 0.6038   | 0.4048   |
+| 0.6308        | 13.0  | 1625 | 1.9221          | 0.6409   | 0.4210   |
+| 0.7327        | 14.0  | 1750 | 1.9986          | 0.6465   | 0.4775   |
+| 0.5175        | 15.0  | 1875 | 2.0520          | 0.6644   | 0.4316   |
+| 0.5302        | 16.0  | 2000 | 2.0989          | 0.6712   | 0.4528   |
+| 0.38          | 17.0  | 2125 | 2.0826          | 0.6734   | 0.4669   |
+| 0.3768        | 18.0  | 2250 | 2.1953          | 0.6611   | 0.4544   |
+| 0.3653        | 19.0  | 2375 | 2.2217          | 0.6880   | 0.5000   |
+| 0.3349        | 20.0  | 2500 | 2.1911          | 0.6880   | 0.4951   |
+| 0.2563        | 21.0  | 2625 | 2.2999          | 0.6813   | 0.4771   |
+| 0.2513        | 22.0  | 2750 | 2.4158          | 0.7037   | 0.4640   |
+| 0.2154        | 23.0  | 2875 | 2.4323          | 0.7138   | 0.4689   |
+| 0.1889        | 24.0  | 3000 | 2.4296          | 0.7037   | 0.4733   |
+| 0.2042        | 25.0  | 3125 | 2.5223          | 0.7071   | 0.4411   |
+| 0.1774        | 26.0  | 3250 | 2.5476          | 0.7037   | 0.5083   |
+| 0.156         | 27.0  | 3375 | 2.5737          | 0.7205   | 0.5236   |
+| 0.1406        | 28.0  | 3500 | 2.6518          | 0.7048   | 0.5220   |
+| 0.144         | 29.0  | 3625 | 2.6388          | 0.7015   | 0.4789   |
+| 0.1119        | 30.0  | 3750 | 2.7159          | 0.7228   | 0.5003   |
+| 0.1187        | 31.0  | 3875 | 2.7170          | 0.7071   | 0.4973   |
+| 0.1095        | 32.0  | 4000 | 2.7796          | 0.7160   | 0.4707   |
+| 0.1082        | 33.0  | 4125 | 2.7926          | 0.7239   | 0.5038   |
+| 0.0976        | 34.0  | 4250 | 2.8240          | 0.7149   | 0.4515   |
+| 0.0885        | 35.0  | 4375 | 2.8532          | 0.7149   | 0.4466   |
+| 0.0872        | 36.0  | 4500 | 2.8697          | 0.7183   | 0.4700   |
+| 0.0795        | 37.0  | 4625 | 2.8467          | 0.7138   | 0.4994   |
+| 0.0878        | 38.0  | 4750 | 2.8566          | 0.7104   | 0.4673   |
+| 0.0886        | 39.0  | 4875 | 2.8951          | 0.7127   | 0.4667   |
+| 0.086         | 40.0  | 5000 | 2.8841          | 0.7127   | 0.4683   |
 ### Framework versions
+- Transformers 4.57.3
 - Pytorch 2.9.1+cu128
+- Datasets 4.4.2
+- Tokenizers 0.22.2

config.json CHANGED Viewed

@@ -11,62 +11,62 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3",
-    "4": "LABEL_4",
-    "5": "LABEL_5",
-    "6": "LABEL_6",
-    "7": "LABEL_7",
-    "8": "LABEL_8",
-    "9": "LABEL_9",
-    "10": "LABEL_10",
-    "11": "LABEL_11",
-    "12": "LABEL_12",
-    "13": "LABEL_13",
-    "14": "LABEL_14",
-    "15": "LABEL_15",
-    "16": "LABEL_16",
-    "17": "LABEL_17",
-    "18": "LABEL_18",
-    "19": "LABEL_19",
-    "20": "LABEL_20",
-    "21": "LABEL_21",
-    "22": "LABEL_22",
-    "23": "LABEL_23",
-    "24": "LABEL_24",
-    "25": "LABEL_25"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_10": 10,
-    "LABEL_11": 11,
-    "LABEL_12": 12,
-    "LABEL_13": 13,
-    "LABEL_14": 14,
-    "LABEL_15": 15,
-    "LABEL_16": 16,
-    "LABEL_17": 17,
-    "LABEL_18": 18,
-    "LABEL_19": 19,
-    "LABEL_2": 2,
-    "LABEL_20": 20,
-    "LABEL_21": 21,
-    "LABEL_22": 22,
-    "LABEL_23": 23,
-    "LABEL_24": 24,
-    "LABEL_25": 25,
-    "LABEL_3": 3,
-    "LABEL_4": 4,
-    "LABEL_5": 5,
-    "LABEL_6": 6,
-    "LABEL_7": 7,
-    "LABEL_8": 8,
-    "LABEL_9": 9
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
+    "0": "1025",
+    "1": "1071",
+    "2": "131",
+    "3": "138",
+    "4": "284",
+    "5": "285",
+    "6": "435",
+    "7": "436",
+    "8": "595",
+    "9": "657",
+    "10": "664",
+    "11": "682",
+    "12": "684",
+    "13": "691",
+    "14": "693",
+    "15": "697",
+    "16": "703",
+    "17": "706",
+    "18": "707",
+    "19": "710",
+    "20": "74",
+    "21": "754",
+    "22": "829",
+    "23": "862",
+    "24": "913",
+    "25": "94"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "1025": 0,
+    "1071": 1,
+    "131": 2,
+    "138": 3,
+    "284": 4,
+    "285": 5,
+    "435": 6,
+    "436": 7,
+    "595": 8,
+    "657": 9,
+    "664": 10,
+    "682": 11,
+    "684": 12,
+    "691": 13,
+    "693": 14,
+    "697": 15,
+    "703": 16,
+    "706": 17,
+    "707": 18,
+    "710": 19,
+    "74": 20,
+    "754": 21,
+    "829": 22,
+    "862": 23,
+    "913": 24,
+    "94": 25
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	- timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- ~~2025~~-11-~~25T09~~:16:23,codecarbon,~~67364f53~~-~~9085~~-~~4f19~~-~~8275~~-~~89bd636a2b29~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~2029~~.~~9565299919923~~,0.~~08855852421617892~~,4.~~362582297096198e~~-05,42.5,~~357~~.~~51475458776764~~,~~755~~.~~7507977485657~~,0.~~02394362658245421~~,0.~~391652365821642~~,0.~~4257112496104668~~,0.~~8413072420145626~~,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.~~8.4~~,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA ~~H100 NVL~~,6.1661,49.7498,2015.~~3354606628418~~,machine,N,1.0


1	+ timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	+ 2026-01-13T21:56:14,codecarbon,393db334-33a7-40a5-8744-5c0c0b8e9695,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,1567.7256346759968,0.06313669868117386,4.02727985590555e-05,368.66620928434594,938.1824499334339,70.0,0.1616492382015093,0.407745279251518,0.030405019717430055,0.5997995371704574,0.0,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,3.2.1,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354835510254,machine,0.6375481386392811,72.2033055198973,4.556418485237484,91.88500246187536,N,1.0,0.0

metrics.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "eval_loss": 1.6857668161392212,
-    "eval_accuracy": 0.6126340882002383,
-    "eval_f1_macro": 0.37374630603711945,
-    "eval_runtime": 1.9694,
-    "eval_samples_per_second": 426.01,
-    "eval_steps_per_second": 13.71,
     "epoch": 40.0
 }

 {
+    "eval_loss": 1.7509655952453613,
+    "eval_accuracy": 0.5454545454545454,
+    "eval_f1_macro": 0.37756565971323075,
+    "eval_runtime": 1.6377,
+    "eval_samples_per_second": 544.058,
+    "eval_steps_per_second": 8.549,
     "epoch": 40.0
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:353cbf904eb669f05fd973997fe856258a73a984f18c256aad21eab9c211a460
 size 498686648

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c2fe7e35982d93c50286a9784e87bbf66572abaff64e2bd958814eddd688e89
 size 498686648