This model is a fine-tuned version of Qwen3-0.6B-Base, fine-tuned on a sub-sample of 6k pairs from MetaMathQA dataset using SFT and the trl library. It is the first step of LaQwenTa, a light-weight STEM QA answering model for educational purposes.
- Downloads last month
- 8
Model tree for jeanprbt/laqwenta_sft_model
Base model
Qwen/Qwen3-0.6B-Base