Instructions to use yuk1chan/qwen3-4b-structeval-strategy2-revised-merged with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use yuk1chan/qwen3-4b-structeval-strategy2-revised-merged with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="yuk1chan/qwen3-4b-structeval-strategy2-revised-merged", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("yuk1chan/qwen3-4b-structeval-strategy2-revised-merged")
model = AutoModelForCausalLM.from_pretrained("yuk1chan/qwen3-4b-structeval-strategy2-revised-merged", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use yuk1chan/qwen3-4b-structeval-strategy2-revised-merged with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/yuk1chan/qwen3-4b-structeval-strategy2-revised-merged

SGLang

How to use yuk1chan/qwen3-4b-structeval-strategy2-revised-merged with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use yuk1chan/qwen3-4b-structeval-strategy2-revised-merged with Docker Model Runner:
```
docker model run hf.co/yuk1chan/qwen3-4b-structeval-strategy2-revised-merged
```

qwen3-4b-structeval-strategy2-revised-merged

This is a merged model fine-tuned from Qwen/Qwen3-4B-Instruct-2507 using the LoRA adapter from yuk1chan/qwen3-4b-structeval-yamlxml-boost-v2-lr6e-6.

🎯 Purpose

This merged model is designed for structured output generation (JSON / YAML / XML / TOML / CSV) and achieves 0.82286 StructEval score.

It serves as the base model for Stage 1: YAML/XML Specialized SFT.

🔥 Key Features: Max Seq Len = 1024

Unlike typical fine-tuning that uses max_seq_length=512, this model was trained with max_seq_length=1024 to:

Process longer sequences: Handle complex YAML/XML documents that exceed 512 tokens
Improve long-context accuracy: Better performance on deeply nested structures
Enhance parsing capabilities: More robust handling of long-formatted data

📊 Training Results (Strategy 2 Revised)

Format	Score	Failures
JSON	100.0% (50/50)	0
YAML	97.1% (34/35)	1
XML	90.0% (18/20)	2
TOML	76.0% (19/25)	6
CSV	95.0% (19/20)	1
Overall	0.82286 (141/150)	9

⚙️ Training Configuration

Base Model

Model: Qwen/Qwen3-4B-Instruct-2507
Parameters: 4.05B (4,055,498,240 total)

LoRA Adapter

Adapter: yuk1chan/qwen3-4b-structeval-yamlxml-boost-v2-lr6e-6
Method: QLoRA (4-bit)
Max sequence length: 1024 🔥
Epochs: 1
Learning rate: 6e-6
LoRA: r=16, alpha=32
Training time: ~12 hours (T4 GPU)

Data Pipeline

Dataset: structeval_runB_yamlxml_boost_v2.jsonl (25,000 samples)
Cleaning: CoT removal, code fence removal, leading phrase removal
u-10bei系: 「Output:」抽出
daichira系: 2xブースト
YAML/XML: 2xブースト

Format Distribution (Estimated)

YAML: ~45% (11,250 samples)
XML: ~20% (5,000 samples)
JSON: ~15% (3,750 samples)
TOML: ~10% (2,500 samples)
CSV: ~10% (2,500 samples)

💻 Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "yuk1chan/qwen3-4b-structeval-strategy2-revised-merged"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.float16,
    device_map="auto",
)

# Inference
prompt = "Generate YAML code for..."
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=1024)
result = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(result)

📚 Training Strategy

Strategy 2 Revised: YAML/XML Clean Expansion

This model incorporates three key improvements:

1. u-10bei系: 「Output:」抽出
  - Extract only content after "Output:" marker
  - Removes explanation-before-output tendency
2. daichira系: 2倍ブースト
  - Boosts "Return ONLY" pattern examples
  - Strengthens direct output without explanation
3. YAML/XML: 2倍ブースト
  - Doubles YAML/XML training data
  - Improves weak format performance

🎯 Expected Improvements from Max Seq Len = 1024

Compared to max_seq_length=512 models:
┌──────────────────┬───────────────────┬────────────────────┐
│      Aspect      │ Max Seq Len = 512 │ Max Seq Len = 1024 │
├──────────────────┼───────────────────┼────────────────────┤
│ Long sequences   │ ❌ Truncated      │ ✅ Fully processed │
├──────────────────┼───────────────────┼────────────────────┤
│ Complex YAML/XML │ ⚠️ Partial        │ ✅ Complete        │
├──────────────────┼───────────────────┼────────────────────┤
│ Deep nesting     │ ⚠️ Limited        │ ✅ Better          │
├──────────────────┼───────────────────┼────────────────────┤
│ Training time    │ 6-8 hours         │ 12 hours           │
└──────────────────┴───────────────────┴────────────────────┘
🔬 Next Steps

This merged model will be used as the base for:

1. Stage 1: YAML/XML Specialized SFT
  - Further enhance YAML/XML to 99-100%
  - Exclude TOML from training data
  - Target: YAML 99%+, XML 96%+
2. Stage 2: TOML Refinement with hard4k
  - Use only daichira/structured-hard-sft-4k (TOML 100%)
  - Low learning rate (3e-6) to preserve base capabilities
  - Target: TOML 90%+

📊 Validation

- Training Loss: ~1.10 → 0.83 (converged well)
- Validation Loss: ~2.04 → 1.32 (converged well)
- All-masked samples: 0% after filtering
- Valid ratio: ~0.60-0.80 (healthy distribution)

⚖️ License

Apache 2.0

📝 Citation

If you use this model, please cite:

@misc{{qwen3-4b-structeval-strategy2-revised-merged,
  title={{Qwen3-4B StructEval Strategy 2 Revised (Merged)}},
  author={{yuk1chan}},
  year={{2026}},
  url={{https://huggingface.co/yuk1chan/qwen3-4b-structeval-strategy2-revised-merged}},
}}

---
Trained with passion over 12 hours using Max Seq Len 1024! 🔥

Base Model: Qwen/Qwen3-4B-Instruct-2507
LoRA Adapter: yuk1chan/qwen3-4b-structeval-yamlxml-boost-v2-lr6e-6

Downloads last month: 3

Safetensors

Model size

4B params

Tensor type

F16

Model tree for yuk1chan/qwen3-4b-structeval-strategy2-revised-merged

Base model

Qwen/Qwen3-4B-Instruct-2507

Adapter

(5634)

this model