End of training

Browse files

Files changed (4) hide show

README.md +77 -0
config.json +15 -0
model.safetensors +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,77 @@

+---
+library_name: transformers
+tags:
+- generated_from_trainer
+model-index:
+- name: slac-single-head
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# slac-single-head
+This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.5153
+- F1 Macro: 0.3202
+- Precision Macro: 0.2175
+- Recall Macro: 0.6571
+- F1 Micro: 0.3451
+- Precision Micro: 0.2281
+- Recall Micro: 0.7091
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1
+- num_epochs: 15
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|
+| 1.3757        | 1.0   | 2    | 1.6881          | 0.1334   | 0.1078          | 0.2213       | 0.1699   | 0.1327          | 0.2364       |
+| 1.0487        | 2.0   | 4    | 1.6485          | 0.2149   | 0.2115          | 0.3791       | 0.2418   | 0.1732          | 0.4          |
+| 1.0743        | 3.0   | 6    | 1.6470          | 0.2522   | 0.2857          | 0.3791       | 0.2895   | 0.2268          | 0.4          |
+| 1.0245        | 4.0   | 8    | 1.6460          | 0.2222   | 0.1721          | 0.3475       | 0.2877   | 0.2308          | 0.3818       |
+| 1.1128        | 5.0   | 10   | 1.6505          | 0.2230   | 0.1862          | 0.3075       | 0.2812   | 0.2466          | 0.3273       |
+| 1.1014        | 6.0   | 12   | 1.6555          | 0.2402   | 0.2075          | 0.3075       | 0.2927   | 0.2647          | 0.3273       |
+| 1.0573        | 7.0   | 14   | 1.6526          | 0.2448   | 0.2038          | 0.3267       | 0.2946   | 0.2568          | 0.3455       |
+| 1.1008        | 8.0   | 16   | 1.6273          | 0.2472   | 0.1970          | 0.3579       | 0.2993   | 0.2391          | 0.4          |
+| 1.0884        | 9.0   | 18   | 1.5901          | 0.2887   | 0.2171          | 0.4627       | 0.3256   | 0.2393          | 0.5091       |
+| 0.9172        | 10.0  | 20   | 1.5491          | 0.3261   | 0.2319          | 0.5814       | 0.3518   | 0.2431          | 0.6364       |
+| 1.0732        | 11.0  | 22   | 1.5236          | 0.3359   | 0.2291          | 0.6799       | 0.3587   | 0.2381          | 0.7273       |
+| 1.2515        | 12.0  | 24   | 1.5132          | 0.3258   | 0.2200          | 0.6799       | 0.3463   | 0.2273          | 0.7273       |
+| 0.9967        | 13.0  | 26   | 1.5129          | 0.3258   | 0.2200          | 0.6799       | 0.3463   | 0.2273          | 0.7273       |
+| 1.009         | 14.0  | 28   | 1.5137          | 0.3182   | 0.2159          | 0.6571       | 0.3421   | 0.2254          | 0.7091       |
+| 0.9672        | 15.0  | 30   | 1.5153          | 0.3202   | 0.2175          | 0.6571       | 0.3451   | 0.2281          | 0.7091       |
+### Framework versions
+- Transformers 4.47.0
+- Pytorch 2.5.1+cu121
+- Datasets 3.3.1
+- Tokenizers 0.21.0

config.json ADDED Viewed

	@@ -0,0 +1,15 @@

+{
+  "architectures": [
+    "BERTModel"
+  ],
+  "model_type": "bert_model",
+  "num_classes": 4,
+  "pos_weight": [
+    24.0,
+    15.666666666666666,
+    4.555555555555555,
+    6.142857142857143
+  ],
+  "torch_dtype": "float32",
+  "transformers_version": "4.47.0"
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d8c6b5a354e4768b82ed9a8917121d01d9207a04b17e4226ae76eafb9f1e7e88
+size 437964888

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4014918409db83589f1bbd62c7b44833686b2ff8b3bad5a08246ddc7f29f031a
+size 5368