CocoRoF commited on
Commit
8efa35d
·
verified ·
1 Parent(s): 2350717

kcbert_1 Done

Browse files
Files changed (1) hide show
  1. README.md +4 -9
README.md CHANGED
@@ -4,18 +4,16 @@ base_model: CocoRoF/KoModernBERT-base-mlm-ckp01
4
  tags:
5
  - generated_from_trainer
6
  model-index:
7
- - name: KoModernBERT-base-mlm-ckp01
8
  results: []
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- # KoModernBERT-base-mlm-ckp01
15
 
16
  This model is a fine-tuned version of [CocoRoF/KoModernBERT-base-mlm-ckp01](https://huggingface.co/CocoRoF/KoModernBERT-base-mlm-ckp01) on an unknown dataset.
17
- It achieves the following results on the evaluation set:
18
- - Loss: 2.7985
19
 
20
  ## Model description
21
 
@@ -40,8 +38,8 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - distributed_type: multi-GPU
42
  - num_devices: 8
43
- - gradient_accumulation_steps: 8
44
- - total_train_batch_size: 1024
45
  - total_eval_batch_size: 64
46
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
  - lr_scheduler_type: linear
@@ -49,9 +47,6 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss |
53
- |:-------------:|:------:|:----:|:---------------:|
54
- | 22.0701 | 0.6089 | 5000 | 2.7985 |
55
 
56
 
57
  ### Framework versions
 
4
  tags:
5
  - generated_from_trainer
6
  model-index:
7
+ - name: KoModernBERT-base-mlm-ckp02
8
  results: []
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ # KoModernBERT-base-mlm-ckp02
15
 
16
  This model is a fine-tuned version of [CocoRoF/KoModernBERT-base-mlm-ckp01](https://huggingface.co/CocoRoF/KoModernBERT-base-mlm-ckp01) on an unknown dataset.
 
 
17
 
18
  ## Model description
19
 
 
38
  - seed: 42
39
  - distributed_type: multi-GPU
40
  - num_devices: 8
41
+ - gradient_accumulation_steps: 16
42
+ - total_train_batch_size: 2048
43
  - total_eval_batch_size: 64
44
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
 
47
 
48
  ### Training results
49
 
 
 
 
50
 
51
 
52
  ### Framework versions