--- base_model: facebook/MobileLLM-R1-140M-base library_name: transformers model_name: tiny-think-sft-math-stem-loss-dft-bf16-e2-bs8 tags: - generated_from_trainer - sft - trl licence: license --- # Model Card for tiny-think-sft-math-stem-loss-dft-bf16-e2-bs8 This model is a fine-tuned version of [facebook/MobileLLM-R1-140M-base](https://huggingface.co/facebook/MobileLLM-R1-140M-base). It has been trained using [TRL](https://github.com/huggingface/trl). ## Training procedure This model was trained with SFT. ### Framework versions - TRL: 0.26.2 - Transformers: 4.57.3 - Pytorch: 2.9.0+cu128 - Datasets: 4.4.2 - Tokenizers: 0.22.2