cedricbonhomme commited on
Commit
d6aff3a
·
verified ·
1 Parent(s): 4aa3382

End of training

Browse files
Files changed (3) hide show
  1. README.md +13 -13
  2. emissions.csv +1 -1
  3. model.safetensors +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [hfl/chinese-macbert-base](https://huggingface.co/hfl/chinese-macbert-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.6118
22
- - Accuracy: 0.7832
23
 
24
  ## Model description
25
 
@@ -39,8 +39,8 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
- - train_batch_size: 32
43
- - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
@@ -48,18 +48,18 @@ The following hyperparameters were used during training:
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
- |:-------------:|:-----:|:-----:|:---------------:|:--------:|
53
- | 0.5706 | 1.0 | 3511 | 0.5875 | 0.7503 |
54
- | 0.5364 | 2.0 | 7022 | 0.5596 | 0.7702 |
55
- | 0.5483 | 3.0 | 10533 | 0.5518 | 0.7768 |
56
- | 0.4161 | 4.0 | 14044 | 0.5757 | 0.7838 |
57
- | 0.351 | 5.0 | 17555 | 0.6118 | 0.7832 |
58
 
59
 
60
  ### Framework versions
61
 
62
- - Transformers 4.57.1
63
  - Pytorch 2.9.1+cu128
64
- - Datasets 4.4.1
65
  - Tokenizers 0.22.1
 
18
 
19
  This model is a fine-tuned version of [hfl/chinese-macbert-base](https://huggingface.co/hfl/chinese-macbert-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.6059
22
+ - Accuracy: 0.7771
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
+ - train_batch_size: 64
43
+ - eval_batch_size: 64
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
 
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
53
+ | 0.5764 | 1.0 | 1772 | 0.6157 | 0.7462 |
54
+ | 0.5644 | 2.0 | 3544 | 0.5618 | 0.7663 |
55
+ | 0.4589 | 3.0 | 5316 | 0.5615 | 0.7781 |
56
+ | 0.3881 | 4.0 | 7088 | 0.5791 | 0.7823 |
57
+ | 0.3433 | 5.0 | 8860 | 0.6059 | 0.7771 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.57.3
63
  - Pytorch 2.9.1+cu128
64
+ - Datasets 4.4.2
65
  - Tokenizers 0.22.1
emissions.csv CHANGED
@@ -1,2 +1,2 @@
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
- 2025-11-25T08:34:42,codecarbon,46016a62-72d2-48da-9d57-e6844c7b1ce7,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,3480.7599397339945,0.14822105400392685,4.2582957908684204e-05,42.5,473.86350310892067,755.7507977485657,0.04105713564181151,0.6370560410333139,0.7299889485153672,1.4081021251904928,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0
 
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
+ 2026-01-03T12:51:21,codecarbon,b34798ef-0b78-4cee-90c2-8e8713063c5a,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,2547.9638344610576,0.127837422724597,5.0172385100449054e-05,42.5,635.5354141556119,755.7507977485657,0.030019676328265212,0.6507438997613697,0.5336937614800426,1.2144573375696777,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA L40S,6.1661,49.7498,2015.3354606628418,machine,N,1.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d96b4403a787e954cfbab7f48f3ad2ab0af89a50b6affd5449df1dc3870ee341
3
  size 409103316
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62c0265be3ac60bd62e8df16ce5b97ca922af06d2aa23ef87ce8ce1fb938222a
3
  size 409103316