HuggingFaceTB
/

SmolLM3-3B-Base

Text Generation

Transformers.js

Model card Files Files and versions

loubnabnl HF Staff commited on Jul 7, 2025

Commit

0ff8759

·

verified ·

1 Parent(s): 951e73c

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -31,11 +31,10 @@ language:
 SmolLM3 is a 3B parameter language model designed to push the boundaries of small models. It supports 6 languages, advanced reasoning and long context. SmolLM3 is a fully open model that offers strong performance at the 3B–4B scale.
-The model is a decoder-only transformer using GQA and NoRope, it was trained on 11.2T tokens with a staged curriculum of web, code, math and reasoning data. Post-training included midtraining on 100B reasoning followed by supervised fine-tuning and alignment via Anchored Preference Optimization.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/61c141342aac764ce1654e43/Zcm_016pWeyFr_uIkT7Ki.png)
 ### Key features
 - Instruct model optimized for **hybrid reasoning**
 - **Fully open model**: open weights + full training details including public data mixture and training configs

 SmolLM3 is a 3B parameter language model designed to push the boundaries of small models. It supports 6 languages, advanced reasoning and long context. SmolLM3 is a fully open model that offers strong performance at the 3B–4B scale.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/61c141342aac764ce1654e43/Zcm_016pWeyFr_uIkT7Ki.png)
+The model is a decoder-only transformer using GQA and NoRope, it was trained on 11.2T tokens with a staged curriculum of web, code, math and reasoning data. Post-training included midtraining on 100B reasoning followed by supervised fine-tuning and alignment via Anchored Preference Optimization.
 ### Key features
 - Instruct model optimized for **hybrid reasoning**
 - **Fully open model**: open weights + full training details including public data mixture and training configs