End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5008
-- Accuracy: 0.8319
 ## Model description
@@ -44,18 +44,19 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 6
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
-|:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.6757        | 1.0   | 14844 | 0.6233          | 0.7403   |
-| 0.5788        | 2.0   | 29688 | 0.5779          | 0.7662   |
-| 0.4837        | 3.0   | 44532 | 0.5358          | 0.7866   |
-| 0.5331        | 4.0   | 59376 | 0.4985          | 0.8092   |
-| 0.3582        | 5.0   | 74220 | 0.4885          | 0.8257   |
-| 0.3038        | 6.0   | 89064 | 0.5008          | 0.8319   |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4995
+- Accuracy: 0.8421
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 7
 ### Training results
+| Training Loss | Epoch | Step   | Validation Loss | Accuracy |
+|:-------------:|:-----:|:------:|:---------------:|:--------:|
+| 0.6798        | 1.0   | 14844  | 0.6302          | 0.7400   |
+| 0.5937        | 2.0   | 29688  | 0.6037          | 0.7617   |
+| 0.5045        | 3.0   | 44532  | 0.5406          | 0.7846   |
+| 0.5463        | 4.0   | 59376  | 0.4999          | 0.8103   |
+| 0.3192        | 5.0   | 74220  | 0.4894          | 0.8257   |
+| 0.2919        | 6.0   | 89064  | 0.4923          | 0.8384   |
+| 0.3553        | 7.0   | 103908 | 0.4995          | 0.8421   |
 ### Framework versions

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2025-12-~~15T11~~:34:11,codecarbon,~~90df3613~~-~~3011~~-~~4a1a~~-~~a356~~-~~4bdeebabc187~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~17918~~.~~7004566479~~,0.~~7681369216126306~~,4.~~286789231568683e~~-05,42.5,~~213~~.~~13512198891044~~,755.7507977485657,0.~~21137087100180996~~,3.~~3277749330511597~~,3.~~7581658972345817~~,7.~~2973117012875415~~,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-12-16T12:21:26,codecarbon,2f52b7fd-7978-4654-8d58-6102e50694c2,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,20689.402613584185,0.8893559856050955,4.2986064035564027e-05,42.5,212.93509257980364,755.7507977485657,0.24405337297347032,3.865553222162319,4.339287941454435,8.448894536590212,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b795a9ab49d4a2d175e4e5b78f9d614a47962e436b3b0ee25c2e8b04837f67b
 size 498618976

 version https://git-lfs.github.com/spec/v1
+oid sha256:8a268ca573d8de0e408f0a0e1878f8ebcb156502847dc97c683f95101d2c46fd
 size 498618976