miitarou's picture
Add README
c7e715f verified
---
base_model: Qwen/Qwen3-4B-Instruct-2507
datasets:
- u-10bei/structured_data_with_cot_dataset_512_v2
language: [en]
license: apache-2.0
pipeline_tag: text-generation
tags: [sft, structured-output]
---
# qwen3-4b-structeval-v9
Simple SFT: v2 dataset only, 2 epochs, strip_cot.
- lr=5e-05, epochs=2, r=64, alpha=128