miitarou's picture
Add README
ce0d29f verified
---
base_model: Qwen/Qwen3-4B-Instruct-2507
datasets:
- u-10bei/structured_data_with_cot_dataset_512_v2
language: [en]
license: apache-2.0
pipeline_tag: text-generation
tags: [sft, structured-output, qwen3]
---
# qwen3-4b-structeval-v13
V11-base + seed=42 only change.
- lr=1.5e-05, epochs=2, r=8, alpha=8, BS=32
- dropout=0, seed=42
- enable_thinking=False, strip_cot=True(strict)