Tokyosaurus
/

gaslighting-detector-binary-xlm-roberta-base

@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3824
-- Accuracy: 0.9121
-- F1: 0.9120
 ## Model description
@@ -41,25 +41,28 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
-| 0.5257        | 1.0   | 102  | 0.3601          | 0.9118   | 0.9174 |
-| 0.2449        | 2.0   | 204  | 0.3734          | 0.9314   | 0.9381 |
-| 0.2091        | 3.0   | 306  | 0.3486          | 0.9314   | 0.9369 |
 ### Framework versions
-- Transformers 4.57.5
-- Pytorch 2.9.1+cpu
 - Datasets 4.5.0
 - Tokenizers 0.22.2

 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5161
+- Accuracy: 0.8938
+- F1: 0.8910
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
+| 0.6300        | 1.0   | 80   | 0.3917          | 0.8516   | 0.8508 |
+| 0.3762        | 2.0   | 160  | 0.4943          | 0.8132   | 0.7866 |
+| 0.3336        | 3.0   | 240  | 0.4476          | 0.8773   | 0.8743 |
+| 0.2274        | 4.0   | 320  | 0.3879          | 0.9048   | 0.9041 |
+| 0.2204        | 5.0   | 400  | 0.5161          | 0.8938   | 0.8910 |
 ### Framework versions
+- Transformers 5.0.0
+- Pytorch 2.7.1+cu118
 - Datasets 4.5.0
 - Tokenizers 0.22.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f32fd86b7c21e2da22fcebbf19def4d6a9aadf4b38c7e013d31700ace7557626
 size 1112204984

 version https://git-lfs.github.com/spec/v1
+oid sha256:4459846fc3685bd89c700f2f7979ed515544f8c966e874d7f1cf40aa21cf1215
 size 1112204984

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3c088c06cf975b7097e469bd69630cdb0d675c6db1ce3af1042b6e19c6d01f22
-size 17082999

 version https://git-lfs.github.com/spec/v1
+oid sha256:e5b633524ba90477daaba16ec27580a08a2856ae0ee8c33d9f5f9358378d3b35
+size 16781751

tokenizer_config.json CHANGED Viewed

@@ -1,51 +1,10 @@
 {
-  "added_tokens_decoder": {
-    "0": {
-      "content": "<s>",
-      "lstrip": false,
-      "normalized": false,
-      "rstrip": false,
-      "single_word": false,
-      "special": true
-    },
-    "1": {
-      "content": "<pad>",
-      "lstrip": false,
-      "normalized": false,
-      "rstrip": false,
-      "single_word": false,
-      "special": true
-    },
-    "2": {
-      "content": "</s>",
-      "lstrip": false,
-      "normalized": false,
-      "rstrip": false,
-      "single_word": false,
-      "special": true
-    },
-    "3": {
-      "content": "<unk>",
-      "lstrip": false,
-      "normalized": false,
-      "rstrip": false,
-      "single_word": false,
-      "special": true
-    },
-    "250001": {
-      "content": "<mask>",
-      "lstrip": true,
-      "normalized": false,
-      "rstrip": false,
-      "single_word": false,
-      "special": true
-    }
-  },
   "bos_token": "<s>",
-  "clean_up_tokenization_spaces": false,
   "cls_token": "<s>",
   "eos_token": "</s>",
-  "extra_special_tokens": {},
   "mask_token": "<mask>",
   "model_max_length": 512,
   "pad_token": "<pad>",

 {
+  "add_prefix_space": true,
+  "backend": "tokenizers",
   "bos_token": "<s>",
   "cls_token": "<s>",
   "eos_token": "</s>",
+  "is_local": false,
   "mask_token": "<mask>",
   "model_max_length": 512,
   "pad_token": "<pad>",