End of training

Browse files

Files changed (5) hide show

README.md +81 -0
generation_config.json +5 -0
model.safetensors +1 -1
runs/Jul01_15-55-48_6ebbe90d6d35/events.out.tfevents.1751385369.6ebbe90d6d35.24.0 +2 -2
runs/Jul01_15-55-48_6ebbe90d6d35/events.out.tfevents.1751422080.6ebbe90d6d35.24.1 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,81 @@

+---
+base_model: yosefw/bert-mini-hebrew
+tags:
+- generated_from_trainer
+model-index:
+- name: bert-mini-hebrew-512
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bert-mini-hebrew-512
+This model is a fine-tuned version of [yosefw/bert-mini-hebrew](https://huggingface.co/yosefw/bert-mini-hebrew) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.9172
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 48
+- eval_batch_size: 48
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- num_epochs: 6
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step  | Validation Loss |
+|:-------------:|:------:|:-----:|:---------------:|
+| 4.4373        | 0.2500 | 2827  | 3.2271          |
+| 3.4195        | 0.4999 | 5654  | 3.1138          |
+| 3.3455        | 0.7499 | 8481  | 3.0875          |
+| 3.3139        | 0.9998 | 11308 | 3.0617          |
+| 3.2911        | 1.2498 | 14135 | 3.0290          |
+| 3.2705        | 1.4997 | 16962 | 3.0172          |
+| 3.2552        | 1.7497 | 19789 | 3.0081          |
+| 3.2445        | 1.9996 | 22616 | 2.9997          |
+| 3.2304        | 2.2496 | 25443 | 2.9834          |
+| 3.2199        | 2.4996 | 28270 | 2.9713          |
+| 3.2144        | 2.7495 | 31097 | 2.9705          |
+| 3.2039        | 2.9995 | 33924 | 2.9559          |
+| 3.192         | 3.2494 | 36751 | 2.9428          |
+| 3.1859        | 3.4994 | 39578 | 2.9412          |
+| 3.1816        | 3.7493 | 42405 | 2.9410          |
+| 3.1774        | 3.9993 | 45232 | 2.9386          |
+| 3.1701        | 4.2492 | 48059 | 2.9343          |
+| 3.1684        | 4.4992 | 50886 | 2.9223          |
+| 3.1651        | 4.7492 | 53713 | 2.9201          |
+| 3.1651        | 4.9991 | 56540 | 2.9164          |
+| 3.156         | 5.2491 | 59367 | 2.9220          |
+| 3.1541        | 5.4990 | 62194 | 2.9213          |
+| 3.1548        | 5.7490 | 65021 | 2.9071          |
+| 3.1561        | 5.9989 | 67848 | 2.9159          |
+### Framework versions
+- Transformers 4.41.2
+- Pytorch 2.1.2
+- Datasets 2.19.2
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,5 @@

+{
+  "_from_model_config": true,
+  "pad_token_id": 0,
+  "transformers_version": "4.41.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:941fe6f6d564ba025d4bb76d135a913b79a8d08d8f7b848db55ba604c7b5bf67
 size 46334240

 version https://git-lfs.github.com/spec/v1
+oid sha256:7c8916add72dcba66a86380111dbb79ae945727542de446a6c4d5e9eeedc256c
 size 46334240

runs/Jul01_15-55-48_6ebbe90d6d35/events.out.tfevents.1751385369.6ebbe90d6d35.24.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2ac48e3d2aead65af7ae3b7bb582330e07e0445f68f023a9263bbdbcede8fb59
-size 16810

 version https://git-lfs.github.com/spec/v1
+oid sha256:cbf4454400f627c6f45464c43aa6421b3b9859a93dbf416c6f334d0fd837cfb4
+size 17170

runs/Jul01_15-55-48_6ebbe90d6d35/events.out.tfevents.1751422080.6ebbe90d6d35.24.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4b01768a5ed59e7f5a67651e633f3f7feef40508d74b31e298fb273fa3f24acf
+size 364