Qwen3-4B StructEval Exp15 (CSV Fix)

Exp13 (SFT+DPO) をベースに、CSV変換の問題を修正するための特化LoRAアダプター。

Overview

Item Detail
Base Model Qwen/Qwen3-4B-Instruct-2507
Parent Adapter Exp13 (SFT + DPO, merged)
Purpose CSV出力の「starting with header row」誤解釈を修正
Method Merged Exp13 + 追加LoRA fine-tuning

Training Details

Strategy

  • Exp13 DPOアダプターをベースモデルにマージ
  • マージ後のモデルに対して、CSV修正用の小規模LoRAを追加学習

Hyperparameters

Parameter Value
Learning Rate 5e-5
Epochs 15
Batch Size 1
Gradient Accumulation 1
LoRA r 8
LoRA alpha 16
LoRA dropout 0.05
Target Modules q/k/v/o_proj, gate/up/down_proj
Precision fp16
Max Sequence Length 2048
Optimizer AdamW
Seed 3407

Training Data

  • CSV修正用データセット(6サンプル)
  • 「starting with header row」を正しく解釈し、ヘッダー+データ行を出力する学習

Hardware

  • NVIDIA DGX Spark (GB10 Blackwell GPU)
  • CUDA 12.1

Inference

System Prompt

You are a structured data expert. Output the requested format directly without any explanation, preamble, or markdown code blocks. Do not write ```json, ```yaml, ```toml, ```xml, ```csv or similar. Output only the raw structured data.

Settings

Parameter Value
Temperature 1e-7 (near-greedy)
Max New Tokens 4096
do_sample False

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch

base_model_id = "Qwen/Qwen3-4B-Instruct-2507"
adapter_id = "tenyyprn/qwen3-4b-structeval-exp15"

tokenizer = AutoTokenizer.from_pretrained(base_model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    base_model_id,
    torch_dtype=torch.float16,
    device_map="auto",
    trust_remote_code=True,
)
model = PeftModel.from_pretrained(model, adapter_id)
model = model.merge_and_unload()
model.eval()

Results

Ensemble (Exp13 + Exp15) の公式コンペスコア: 0.781733

Exp15 単体ローカル評価

Format Score
JSON High
YAML High
XML High
CSV Improved (fix applied)
TOML Challenging

Related Models

License

Apache 2.0 (following base model license)

Downloads last month
22
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tenyyprn/qwen3-4b-structeval-exp15

Adapter
(5270)
this model