eternalGenius commited on
Commit
440cf45
·
verified ·
1 Parent(s): 3f34b2a

eternalGenius/rubert_level2

Browse files
Files changed (3) hide show
  1. README.md +18 -16
  2. model.safetensors +1 -1
  3. training_args.bin +2 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  library_name: transformers
3
- base_model: DeepPavlov/rubert-base-cased
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,12 +13,12 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # rubert_level2
15
 
16
- This model is a fine-tuned version of [DeepPavlov/rubert-base-cased](https://huggingface.co/DeepPavlov/rubert-base-cased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.2285
19
- - F1 Micro: 0.5449
20
- - F1 Macro: 0.4966
21
- - F1 Weighted: 0.4948
22
 
23
  ## Model description
24
 
@@ -37,29 +37,31 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 2e-05
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
- - lr_scheduler_warmup_ratio: 0.1
47
- - num_epochs: 5
 
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted |
52
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|
53
- | 0.3616 | 1.0 | 97 | 0.3379 | 0.0 | 0.0 | 0.0 |
54
- | 0.2963 | 2.0 | 194 | 0.2936 | 0.0759 | 0.0496 | 0.0621 |
55
- | 0.2489 | 3.0 | 291 | 0.2571 | 0.2576 | 0.2077 | 0.2098 |
56
- | 0.2138 | 4.0 | 388 | 0.2371 | 0.5032 | 0.4528 | 0.4480 |
57
- | 0.2011 | 5.0 | 485 | 0.2285 | 0.5449 | 0.4966 | 0.4948 |
 
58
 
59
 
60
  ### Framework versions
61
 
62
- - Transformers 4.57.1
63
- - Pytorch 2.8.0+cu128
64
  - Datasets 4.0.0
65
  - Tokenizers 0.22.2
 
1
  ---
2
  library_name: transformers
3
+ base_model: eternalGenius/rubert_level2
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # rubert_level2
15
 
16
+ This model is a fine-tuned version of [eternalGenius/rubert_level2](https://huggingface.co/eternalGenius/rubert_level2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.1661
19
+ - F1 Micro: 0.7178
20
+ - F1 Macro: 0.7076
21
+ - F1 Weighted: 0.7118
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 5e-06
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
+ - lr_scheduler_warmup_steps: 0.1
47
+ - num_epochs: 15
48
+ - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | F1 Weighted |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|
54
+ | 0.1082 | 1.0 | 97 | 0.1827 | 0.6862 | 0.6628 | 0.6653 |
55
+ | 0.0939 | 2.0 | 194 | 0.1743 | 0.7165 | 0.7000 | 0.7067 |
56
+ | 0.0861 | 3.0 | 291 | 0.1737 | 0.7198 | 0.7049 | 0.7055 |
57
+ | 0.0796 | 4.0 | 388 | 0.1735 | 0.7160 | 0.7074 | 0.7095 |
58
+ | 0.0771 | 5.0 | 485 | 0.1699 | 0.7089 | 0.6921 | 0.6923 |
59
+ | 0.0668 | 6.0 | 582 | 0.1661 | 0.7178 | 0.7076 | 0.7118 |
60
 
61
 
62
  ### Framework versions
63
 
64
+ - Transformers 5.0.0
65
+ - Pytorch 2.10.0+cu128
66
  - Datasets 4.0.0
67
  - Tokenizers 0.22.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c5803f96246c77e78c8d3fcddff4eedea298c07bef31a4b5fe925483fb9b5c79
3
  size 711471116
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a30eaf03664ae21673c57cc697688eec0fbfafc93c991ea2503df21432bee54b
3
  size 711471116
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f69b2f3cc112be17d377924145a397414f4822261fc8d4e4115ca48527892265
3
- size 5841
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9771be977be3b8504664036ddfa90cf77e365fe48fa1f596388626f3a57d2cfc
3
+ size 5201