sunitapalubanjar
/

sft_smolLM2_bad

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

sunitapalubanjar commited on Oct 23, 2025

Commit

917d098

·

verified ·

1 Parent(s): 328188a

End of training

Files changed (4) hide show

README.md +7 -7
model.safetensors +1 -1
tokenizer.json +1 -6
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.4444
 ## Model description
@@ -47,12 +47,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.1885        | 0.32  | 200  | 3.3484          |
-| 2.9515        | 0.64  | 400  | 3.3259          |
-| 2.8161        | 0.96  | 600  | 3.2769          |
-| 1.8934        | 1.28  | 800  | 3.4197          |
-| 1.8574        | 1.6   | 1000 | 3.4540          |
-| 1.8602        | 1.92  | 1200 | 3.4444          |
 ### Framework versions

 This model is a fine-tuned version of [HuggingFaceTB/SmolLM2-135M](https://huggingface.co/HuggingFaceTB/SmolLM2-135M) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7054
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.9605        | 0.32  | 200  | 3.6155          |
+| 2.3145        | 0.64  | 400  | 3.5834          |
+| 2.4709        | 0.96  | 600  | 3.4613          |
+| 1.5769        | 1.28  | 800  | 3.7328          |
+| 1.6233        | 1.6   | 1000 | 3.7155          |
+| 1.6837        | 1.92  | 1200 | 3.7054          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a9ffb252c9c15410d9751bb20a850c6f27ba9651286a8d1ef4e131ad5b29dd04
 size 538090408

 version https://git-lfs.github.com/spec/v1
+oid sha256:24c6bb5478fa298deb0aaac3d56a288fb71011627aa495e1aa6b81a2ca6d0589
 size 538090408

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 8192,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ac70bba75a7c4f38612393155fccfcd986b01d69e8eda5190eb5c6a5a5bbf163
 size 5777

 version https://git-lfs.github.com/spec/v1
+oid sha256:940aaab86ff52d39f8d0c940ca8353b72f563447e9b5b9b3fba0e42411e87a76
 size 5777