Hartunka
/

tiny_bert_rand_20_v1_stsb

+---
+library_name: transformers
+base_model: Hartunka/tiny_bert_rand_20_v1
+tags:
+- generated_from_trainer
+metrics:
+- spearmanr
+model-index:
+- name: tiny_bert_rand_20_v1_stsb
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# tiny_bert_rand_20_v1_stsb
+This model is a fine-tuned version of [Hartunka/tiny_bert_rand_20_v1](https://huggingface.co/Hartunka/tiny_bert_rand_20_v1) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.5196
+- Pearson: 0.2860
+- Spearmanr: 0.2863
+- Combined Score: 0.2861
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 256
+- eval_batch_size: 256
+- seed: 10
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 50
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
+| 3.3912        | 1.0   | 23   | 2.2253          | 0.1250  | 0.1099    | 0.1174         |
+| 2.0454        | 2.0   | 46   | 2.6572          | 0.1081  | 0.0913    | 0.0997         |
+| 1.8227        | 3.0   | 69   | 2.3758          | 0.1796  | 0.1638    | 0.1717         |
+| 1.6031        | 4.0   | 92   | 2.4632          | 0.2419  | 0.2344    | 0.2381         |
+| 1.2975        | 5.0   | 115  | 2.4302          | 0.2774  | 0.2740    | 0.2757         |
+| 1.028         | 6.0   | 138  | 2.5196          | 0.2860  | 0.2863    | 0.2861         |
+### Framework versions
+- Transformers 4.50.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.18.0
+- Tokenizers 0.21.1

logs/events.out.tfevents.1744654737.s_005_m.2701049.102 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d6d7b6d10cab14df3bbb344a13f6448735cdd42ce8ce8cf31d3fdb90e8aa1a59
-size 8168

 version https://git-lfs.github.com/spec/v1
+oid sha256:99d8b083c9c694e8a63b241dc44ca2c7a6a016c132604c71c67200dc3c66a18c
+size 9166

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0b89cbbd313e0f9fe47f27f27b703b497399e157a96e825c97b9362483b9d906
 size 131854692

 version https://git-lfs.github.com/spec/v1
+oid sha256:0fe396a863467c42fdb3f1d71703e1a0bca9543d538840eb2cd05d1f98375ad1
 size 131854692