BabyLM-community
/

nor-baseline-fast

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

jumelet commited on Jul 20, 2025

Commit

01e8cc3

·

verified ·

1 Parent(s): 7f86121

BabyLM-community/nor-baseline-fast

Files changed (3) hide show

README.md +11 -11
model.safetensors +1 -1
tokenizer.json +2 -16

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.3872
 ## Model description
@@ -45,16 +45,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.8947        | 1.0   | 62   | 5.3381          |
-| 4.8971        | 2.0   | 124  | 5.0555          |
-| 4.6477        | 3.0   | 186  | 4.8338          |
-| 4.4732        | 4.0   | 248  | 4.6630          |
-| 4.3394        | 5.0   | 310  | 4.5579          |
-| 4.2559        | 6.0   | 372  | 4.4896          |
-| 4.184         | 7.0   | 434  | 4.4430          |
-| 4.1493        | 8.0   | 496  | 4.4121          |
-| 4.1168        | 9.0   | 558  | 4.3933          |
-| 4.11          | 10.0  | 620  | 4.3872          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.3822
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.8865        | 1.0   | 62   | 5.3373          |
+| 4.8902        | 2.0   | 124  | 5.0404          |
+| 4.6288        | 3.0   | 186  | 4.8065          |
+| 4.4524        | 4.0   | 248  | 4.6461          |
+| 4.3286        | 5.0   | 310  | 4.5491          |
+| 4.2488        | 6.0   | 372  | 4.4846          |
+| 4.177         | 7.0   | 434  | 4.4385          |
+| 4.143         | 8.0   | 496  | 4.4071          |
+| 4.1105        | 9.0   | 558  | 4.3883          |
+| 4.1035        | 10.0  | 620  | 4.3822          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e186c2108988a415be5ad864881006a3a6cd8db704a939a9a201a1db5dc0c5f9
 size 68273200

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e3ba36ea7cf7ab30e81c8eb9b9d283474eb41b796376222f52c525a33d6e20c
 size 68273200

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 512
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 1,
-    "pad_type_id": 0,
-    "pad_token": "<pad>"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,