BabyLM-community/ALL-baseline-small

Files changed (5) hide show

README.md CHANGED Viewed

@@ -12,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
 # ALL-baseline-small
-This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.1017
 ## Model description
@@ -43,9 +43,9 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 7.0297        | 1.0   | 55   | 6.1017          |
 ### Framework versions

 # ALL-baseline-small
+This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.4514
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 3.8625        | 1.0   | 92700 | 3.4514          |
 ### Framework versions

merges.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:135f00d9dd41e3b369736d88bbfa584009ae5c9947f2563bbcb6ac1ad9b9273d
 size 118604848

 version https://git-lfs.github.com/spec/v1
+oid sha256:25ed471e323b79212b5d0c9384a13e9bde3bca84fbae51a1d946dde30b9863d8
 size 118604848

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

vocab.json CHANGED Viewed

The diff for this file is too large to render. See raw diff