AssistantsLab
/

SmolLM2-1.7B-humanized

Text Generation

text-generation-inference

Model card Files Files and versions

Michielo commited on Feb 16, 2025

Commit

50b6440

·

verified ·

1 Parent(s): 93c2b0a

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -71,6 +71,9 @@ In this section, we report the evaluation results of SmolLM2. All evaluations ar
 ## Instruction model Vs. Humanized model
 | Metric                       | SmolLM2-1.7B-Instruct | SmolLM2-1.7B-Humanized | Difference |
 |:-----------------------------|:---------------------:|:----------------------:|:----------:|
 | MMLU                         | **49.5**              | 48.8                   | -0.7       |

 ## Instruction model Vs. Humanized model
+### Note
+We observe an unexpectedly worse TriviaQA score compared to the base instruct model. A bit of training on a dataset such as squad-v2 quickly resolves this issue and just one epoch results in a TriviaQA score far above the base instruct model (>21). We did not release this model due to worse scores on different metrics after this one epoch training. If your specific use-case requires a better grasp of trivia, feel free to train on squad-v2.
 | Metric                       | SmolLM2-1.7B-Instruct | SmolLM2-1.7B-Humanized | Difference |
 |:-----------------------------|:---------------------:|:----------------------:|:----------:|
 | MMLU                         | **49.5**              | 48.8                   | -0.7       |