Laseung/roberta-base-klue-ynat-classification

Files changed (6) hide show

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4450
-- Accuracy: 0.872
 ## Model description
@@ -41,7 +41,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 1
@@ -49,12 +49,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.551         | 1.0   | 1250 | 0.5041          | 0.854    |
 ### Framework versions
-- Transformers 4.50.0
 - Pytorch 2.8.0+cu126
-- Datasets 3.5.0
-- Tokenizers 0.21.4

 This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4444
+- Accuracy: 0.864
 ## Model description
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 1
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.5365        | 1.0   | 1250 | 0.5129          | 0.848    |
 ### Framework versions
+- Transformers 4.57.1
 - Pytorch 2.8.0+cu126
+- Datasets 4.0.0
+- Tokenizers 0.22.1

config.json CHANGED Viewed

@@ -5,6 +5,7 @@
   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,
   "classifier_dropout": null,
   "eos_token_id": 2,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
@@ -39,8 +40,7 @@
   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
   "tokenizer_class": "BertTokenizer",
-  "torch_dtype": "float32",
-  "transformers_version": "4.50.0",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 32000

   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,
   "classifier_dropout": null,
+  "dtype": "float32",
   "eos_token_id": 2,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
   "tokenizer_class": "BertTokenizer",
+  "transformers_version": "4.57.1",
   "type_vocab_size": 1,
   "use_cache": true,
   "vocab_size": 32000

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:120bebf78756b3842c8c1e41fca12e97079d6824d7e2d2fbbdbbc2d070733efc
 size 442518124

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ad81af3fbfbdd1a36b05cd3a4896e7a81e2abba5cb5fd348dbd8593bd96bb65
 size 442518124

runs/Nov12_03-30-39_4594ddad2c7a/events.out.tfevents.1762918334.4594ddad2c7a.753.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:afc0d759a03a426c1867ae3ad0176bfdaa088c63160b53958f28f2f01fb821a6
+size 6604

runs/Nov12_03-30-39_4594ddad2c7a/events.out.tfevents.1762919370.4594ddad2c7a.753.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a9269ed73cfa38714dc33f81a6c5aa7f20d885567722929c3ff998ec50cb2b69
+size 411

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51b3fe2b1714cd7ea13c71a2da84a283d82d66950d47bafd24cfa77c1fe7f39e
-size 5777

 version https://git-lfs.github.com/spec/v1
+oid sha256:dfd363f5d85c91a585dbdc544176ba2f759bfc370ae474bfb76f8fc172106d0d
+size 5841