SlimFactoryHub
/

SlimMoE-250M-SFT-v2

Text Generation

Text-Generation

Instruction Following

Model card Files Files and versions

Aispace2001 commited on Dec 31, 2025

Commit

fe8a438

·

verified ·

1 Parent(s): b2d539f

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -141,6 +141,8 @@ We would like to thank the dataset providers and the open-source community whose
 - **CAIS** for the **MMLU** dataset used for auxiliary knowledge and reasoning supervision.
 - **HuggingFaceTB** for the **OpenHermes-2.5-H4** dataset used in the final instruction refinement phase.
 - **Weights & Biases (W&B)** for logging and visualization tools used to monitor training progress.
 We also acknowledge the broader open-source research community for their continuous efforts in advancing efficient model architectures and training methodologies.

 - **CAIS** for the **MMLU** dataset used for auxiliary knowledge and reasoning supervision.
 - **HuggingFaceTB** for the **OpenHermes-2.5-H4** dataset used in the final instruction refinement phase.
 - **Weights & Biases (W&B)** for logging and visualization tools used to monitor training progress.
+- Additionally, we drew valuable insights from **The Smol Training Playbook: The Secrets to Building World-Class LLMs**, published by Hugging Face, which informed several practical decisions in our training and experimentation workflow.
+Playbook link: https://huggingfacetb-smol-training-playbook.hf.space/the-smol-training-playbook-the-secrets-to-building-world-class-llms.pdf
 We also acknowledge the broader open-source research community for their continuous efforts in advancing efficient model architectures and training methodologies.