miitarou's picture
Add README
c7e715f verified
metadata
base_model: Qwen/Qwen3-4B-Instruct-2507
datasets:
  - u-10bei/structured_data_with_cot_dataset_512_v2
language:
  - en
license: apache-2.0
pipeline_tag: text-generation
tags:
  - sft
  - structured-output

qwen3-4b-structeval-v9

Simple SFT: v2 dataset only, 2 epochs, strip_cot.

  • lr=5e-05, epochs=2, r=64, alpha=128