End of training

Browse files

Files changed (4) hide show

README.md +18 -18
config.json +2 -2
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -8,20 +8,20 @@ metrics:
 - accuracy
 - f1
 model-index:
-- name: results
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# results
 This model is a fine-tuned version of [monologg/koelectra-base-v3-discriminator](https://huggingface.co/monologg/koelectra-base-v3-discriminator) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6599
-- Accuracy: 0.6028
-- F1: 0.5504
 ## Model description
@@ -33,35 +33,35 @@ More information needed
 ## Training and evaluation data
-- [github/dev-jaemin/Korean-MBTI-Conversation-Dataset](https://github.com/dev-jaemin/Korean-MBTI-Conversation-Dataset)
-- use data in qna_cleaned.tsv, multiple_qna_cleaned.tsv
-- refine [answer, a_mbti]
-- concat both refiend data
-- Training and evaluation data split 8:2 ratio
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 4e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
-| 0.6604        | 0.0669 | 200  | 0.6573          | 0.6132   | 0.5419 |
-| 0.6629        | 0.1339 | 400  | 0.6563          | 0.6200   | 0.5321 |
-| 0.6549        | 0.2008 | 600  | 0.6542          | 0.6237   | 0.5145 |
-| 0.6587        | 0.2677 | 800  | 0.6599          | 0.6028   | 0.5504 |
-| 0.6631        | 0.3347 | 1000 | 0.6599          | 0.6028   | 0.5503 |
-| 0.6584        | 0.4016 | 1200 | 0.6594          | 0.6105   | 0.3872 |
-| 0.6588        | 0.4685 | 1400 | 0.6590          | 0.6105   | 0.3872 |
 ### Framework versions

 - accuracy
 - f1
 model-index:
+- name: mbti_4axis_koelectra
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mbti_4axis_koelectra
 This model is a fine-tuned version of [monologg/koelectra-base-v3-discriminator](https://huggingface.co/monologg/koelectra-base-v3-discriminator) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6614
+- Accuracy: 0.6027
+- F1: 0.6878
 ## Model description
 ## Training and evaluation data
+More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
+| 0.6586        | 0.3347 | 1000 | 0.6545          | 0.5338   | 0.5518 |
+| 0.6605        | 0.6693 | 2000 | 0.6594          | 0.5354   | 0.5261 |
+| 0.6615        | 1.0040 | 3000 | 0.6617          | 0.5166   | 0.3746 |
+| 0.663         | 1.3387 | 4000 | 0.6614          | 0.5308   | 0.6315 |
+| 0.6615        | 1.6734 | 5000 | 0.6617          | 0.5227   | 0.6865 |
+| 0.6592        | 2.0080 | 6000 | 0.6614          | 0.6027   | 0.6878 |
+| 0.6484        | 2.3427 | 7000 | 0.6488          | 0.5657   | 0.5889 |
+| 0.666         | 2.6774 | 8000 | 0.6614          | 0.4390   | 0.4533 |
+| 0.6466        | 3.0120 | 9000 | 0.6483          | 0.5402   | 0.5783 |
 ### Framework versions

config.json CHANGED Viewed

@@ -24,9 +24,9 @@
   "intermediate_size": 3072,
   "label2id": {
     "IE": 0,
     "SN": 1,
-    "TF": 2,
-    "JP": 3
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,

   "intermediate_size": 3072,
   "label2id": {
     "IE": 0,
+    "JP": 3,
     "SN": 1,
+    "TF": 2
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f1a9175fe1b9ea66f81fdf1c4af35ba1295ce330cbb6279150b15164d20aafd0
 size 451721824

 version https://git-lfs.github.com/spec/v1
+oid sha256:ec5137cf34017c6a6b4a7763268336cfb16a75e3e001458fca6fd1eb7c153e0f
 size 451721824

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:72483dff6de8d981085c7c018954068b3ad824b171f1b7a69a9a9751bbe43c37
 size 5713

 version https://git-lfs.github.com/spec/v1
+oid sha256:a98d146f2c9d42d9028a38e621b0eb0e5b6e3c9018c9b564631c824594d3573a
 size 5713