Hartunka
/

tiny_bert_rand_100_v2

@@ -1,24 +1,11 @@
 ---
-library_name: transformers
 tags:
 - generated_from_trainer
-datasets:
-- Hartunka/processed_wikitext-103-raw-v1-rand-100
 metrics:
 - accuracy
 model-index:
 - name: tiny_bert_rand_100_v2
-  results:
-  - task:
-      name: Masked Language Modeling
-      type: fill-mask
-    dataset:
-      name: Hartunka/processed_wikitext-103-raw-v1-rand-100
-      type: Hartunka/processed_wikitext-103-raw-v1-rand-100
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.15262473865626944
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -26,10 +13,10 @@ should probably proofread and complete it, then remove this comment. -->
 # tiny_bert_rand_100_v2
-This model is a fine-tuned version of [](https://huggingface.co/) on the Hartunka/processed_wikitext-103-raw-v1-rand-100 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.8934
-- Accuracy: 0.1526
 ## Model description
@@ -52,7 +39,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 96
 - eval_batch_size: 96
 - seed: 10
-- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10000
 - num_epochs: 25
@@ -61,16 +48,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
 |:-------------:|:-------:|:-----:|:---------------:|:--------:|
-| 10.6718       | 4.1982  | 10000 | 10.8784         | 0.1502   |
-| 10.0251       | 8.3963  | 20000 | 11.1457         | 0.1529   |
-| 9.1754        | 12.5945 | 30000 | 11.7428         | 0.1530   |
-| 8.1981        | 16.7926 | 40000 | 12.5714         | 0.1505   |
-| 7.4533        | 20.9908 | 50000 | 13.6521         | 0.1509   |
 ### Framework versions
-- Transformers 4.50.2
-- Pytorch 2.2.1+cu121
-- Datasets 2.18.0
-- Tokenizers 0.21.1

 ---
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: tiny_bert_rand_100_v2
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # tiny_bert_rand_100_v2
+This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 13.4746
+- Accuracy: 0.1518
 ## Model description
 - train_batch_size: 96
 - eval_batch_size: 96
 - seed: 10
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10000
 - num_epochs: 25
 | Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
 |:-------------:|:-------:|:-----:|:---------------:|:--------:|
+| 10.6784       | 4.1982  | 10000 | 10.8779         | 0.1501   |
+| 10.0232       | 8.3963  | 20000 | 11.1558         | 0.1530   |
+| 9.1752        | 12.5945 | 30000 | 11.7162         | 0.1537   |
+| 8.2175        | 16.7926 | 40000 | 12.6584         | 0.1513   |
+| 7.4989        | 20.9908 | 50000 | 13.4746         | 0.1518   |
 ### Framework versions
+- Transformers 4.40.0
+- Pytorch 2.6.0+cu124
+- Datasets 3.5.0
+- Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d8d43b9208d9a918ae0622d49d096b5afe581f70b2180f15430c09a200b939bd
 size 133235168

 version https://git-lfs.github.com/spec/v1
+oid sha256:280b72a2ad91b56df6e454884916a9a86cfa211dabcaf8bab104f628ea78d8d4
 size 133235168