Training in progress, epoch 3, checkpoint

Browse files

Files changed (7) hide show

last-checkpoint/README.md +34 -6
last-checkpoint/model.safetensors +1 -1
last-checkpoint/optimizer.pt +1 -1
last-checkpoint/rng_state.pth +1 -1
last-checkpoint/scaler.pt +1 -1
last-checkpoint/scheduler.pt +1 -1
last-checkpoint/trainer_state.json +19 -3

last-checkpoint/README.md CHANGED Viewed

@@ -38,6 +38,21 @@ widget:
   - kids game
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 ---
 # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
@@ -101,9 +116,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0000, 0.7366, 0.4534],
-#         [0.7366, 1.0000, 0.4396],
-#         [0.4534, 0.4396, 1.0000]])
 ```
 <!--
@@ -130,6 +145,18 @@ You can finetune this model on your own dataset.
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
 <!--
 ## Bias, Risks and Limitations
@@ -337,9 +364,10 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch  | Step | Training Loss |
-|:------:|:----:|:-------------:|
-| 0.0004 | 1    | 5.3655        |
 ### Framework Versions

   - kids game
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
+metrics:
+- cosine_accuracy
+model-index:
+- name: SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
+  results:
+  - task:
+      type: triplet
+      name: Triplet
+    dataset:
+      name: Unknown
+      type: unknown
+    metrics:
+    - type: cosine_accuracy
+      value: 0.9412940740585327
+      name: Cosine Accuracy
 ---
 # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 0.7198, 0.3823],
+#         [0.7198, 1.0000, 0.3737],
+#         [0.3823, 0.3737, 1.0000]])
 ```
 <!--
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
+## Evaluation
+### Metrics
+#### Triplet
+* Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
+| Metric              | Value      |
+|:--------------------|:-----------|
+| **cosine_accuracy** | **0.9413** |
 <!--
 ## Bias, Risks and Limitations
 </details>
 ### Training Logs
+| Epoch  | Step | Training Loss | Validation Loss | cosine_accuracy |
+|:------:|:----:|:-------------:|:---------------:|:---------------:|
+| 0.0004 | 1    | 5.3655        | -               | -               |
+| 2.1949 | 5000 | 2.1423        | 0.7694          | 0.9413          |
 ### Framework Versions

last-checkpoint/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:de1414049a746b2e56574aabef3303b914e7b9a0f4c442d93140d5bdb4a130cc
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:9e46149fd09a9867b9acad65acdb71570057411c6a87b5b28cc4922225edf94c
 size 90864192

last-checkpoint/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:168f24d2777ab0fe7d549b5cbc57061de102566dcc627477843b4c54d872bad6
 size 180607738

 version https://git-lfs.github.com/spec/v1
+oid sha256:f309c0f49859e92f45e91d15d010c986b3039d5aee5aa13a7a6a8b652636cbd3
 size 180607738

last-checkpoint/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d504d17d70cbf3694e88be868534a2e1dd8fdc9f2518c1e988f6b7f7aa42bff4
 size 14244

 version https://git-lfs.github.com/spec/v1
+oid sha256:d11ae26ad0553937353377362dcdfdfc64b495a56e520ee9d5cafa528daa8602
 size 14244

last-checkpoint/scaler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5d473dbc02d3ee7bd9625ff87a50bb89b5cfc0fd4775012466ec6bee2c111c2e
 size 988

 version https://git-lfs.github.com/spec/v1
+oid sha256:38e75fca916f8178bf9cd33054df9c31b71689bd5bddb2e11917964dcae00b45
 size 988

last-checkpoint/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c0b735d1c7f640d3ef4cf28a2af2021687802dd1b42e9507ad99c430c545922c
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b8ed2557d72b721bbe933588bb84b4e8fd67437924faa2318d545f860f51f41
 size 1064

last-checkpoint/trainer_state.json CHANGED Viewed

@@ -2,9 +2,9 @@
   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 2.0,
   "eval_steps": 5000,
-  "global_step": 4556,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
@@ -15,6 +15,22 @@
       "learning_rate": 0.0,
       "loss": 5.3655,
       "step": 1
     }
   ],
   "logging_steps": 5000,
@@ -29,7 +45,7 @@
         "should_evaluate": false,
         "should_log": false,
         "should_save": true,
-        "should_training_stop": false
       },
       "attributes": {}
     }

   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 3.0,
   "eval_steps": 5000,
+  "global_step": 6834,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
       "learning_rate": 0.0,
       "loss": 5.3655,
       "step": 1
+    },
+    {
+      "epoch": 2.194907813871817,
+      "grad_norm": 5.572308540344238,
+      "learning_rate": 2.2396976347232385e-05,
+      "loss": 2.1423,
+      "step": 5000
+    },
+    {
+      "epoch": 2.194907813871817,
+      "eval_cosine_accuracy": 0.9412940740585327,
+      "eval_loss": 0.7694418430328369,
+      "eval_runtime": 32.5011,
+      "eval_samples_per_second": 292.451,
+      "eval_steps_per_second": 2.308,
+      "step": 5000
     }
   ],
   "logging_steps": 5000,
         "should_evaluate": false,
         "should_log": false,
         "should_save": true,
+        "should_training_stop": true
       },
       "attributes": {}
     }