End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -14,8 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
 # ner-bert-ingredientstesting
 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 4.2773
 ## Model description
@@ -35,10 +33,10 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
-- gradient_accumulation_steps: 4
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -47,14 +45,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 1.0902        | 1.0   | 1    | 4.2773          |
 ### Framework versions
-- Transformers 4.35.2
-- Pytorch 2.1.0+cu121
-- Datasets 2.16.1
 - Tokenizers 0.15.0

 # ner-bert-ingredientstesting
 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 64
 - seed: 42
+- gradient_accumulation_steps: 8
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
 ### Framework versions
+- Transformers 4.36.0
+- Pytorch 2.0.0
+- Datasets 2.1.0
 - Tokenizers 0.15.0

config.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

id2tag.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

logs/events.out.tfevents.1705477323.50af3b613f29.26.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1af359cecf22ebfe94f706842c1951bec707d85ec0307384b85e804dc460989f
+size 1717218

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1694f6994992dba42455a472f699bf50f65965ba6df315bbe7f3c0ac08d286d1
-size 435820636

 version https://git-lfs.github.com/spec/v1
+oid sha256:a39fb56d4a6d853376c05edfa3d9c7f19663dde1fc7390f48f13c7ac53cae078
+size 535667604

trainer_state.json CHANGED Viewed

@@ -1,42 +1,30 @@
 {
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 1.0,
-  "eval_steps": 1,
-  "global_step": 1,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
   "log_history": [
     {
       "epoch": 1.0,
-      "learning_rate": 0.0,
-      "loss": 1.0902,
-      "step": 1
-    },
-    {
-      "epoch": 1.0,
-      "eval_loss": 4.27725887298584,
-      "eval_runtime": 0.0648,
-      "eval_samples_per_second": 30.874,
-      "eval_steps_per_second": 15.437,
-      "step": 1
-    },
-    {
-      "epoch": 1.0,
-      "step": 1,
-      "total_flos": 576049352400.0,
-      "train_loss": 1.0902321338653564,
-      "train_runtime": 7.4417,
-      "train_samples_per_second": 1.075,
-      "train_steps_per_second": 0.134
     }
   ],
-  "logging_steps": 1,
-  "max_steps": 1,
   "num_train_epochs": 1,
-  "save_steps": 20,
-  "total_flos": 576049352400.0,
   "trial_name": null,
   "trial_params": null
 }

 {
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 0.9996631862579993,
+  "eval_steps": 750,
+  "global_step": 742,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
   "log_history": [
     {
       "epoch": 1.0,
+      "step": 742,
+      "total_flos": 3.211629347340288e+16,
+      "train_loss": 5.698867345434636,
+      "train_runtime": 6114.1675,
+      "train_samples_per_second": 15.538,
+      "train_steps_per_second": 0.121
     }
   ],
+  "logging_steps": 750,
+  "max_steps": 742,
+  "num_input_tokens_seen": 0,
   "num_train_epochs": 1,
+  "save_steps": 750,
+  "total_flos": 3.211629347340288e+16,
+  "train_batch_size": 16,
   "trial_name": null,
   "trial_params": null
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:21bafcad3f9c59888faef07c61f3e3aa60532b2d9ed02dd791605896705b4331
-size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:d5cdffc1d5610491dce8a2d5fba106c7daa7c5df5aa173ecccf1bf30d7a688e2
+size 4283