jeanprbt
/

laqwenta_sft_model

Text Generation

text-generation-inference

Model card Files Files and versions

laqwenta_sft_model / README.md

jeanprbt's picture

doc: update model card

d6c3e4b verified 11 months ago

|

history blame contribute delete

531 Bytes

	---
	library_name: transformers
	datasets:
	- meta-math/MetaMathQA
	base_model:
	- Qwen/Qwen3-0.6B-Base
	pipeline_tag: text-generation
	---

	This model is a fine-tuned version of [Qwen3-0.6B-Base](https://huggingface.co/Qwen/Qwen3-0.6B-Base), fine-tuned on a sub-sample of 6k pairs from [MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA) dataset using SFT and the [`trl`](https://huggingface.co/docs/trl/index) library. It is the first step of LaQwenTa, a light-weight STEM QA answering model for educational purposes.