lin_s3w50_dpow50

This is a fully merged model based on Qwen/Qwen3-4B-Instruct-2507, optimized for structured output generation (JSON / YAML / XML / TOML / CSV).

Merge Strategy

Linear interpolation (50:50) of two merged models:

merged = 0.5 * sft_s3_merged + 0.5 * dpo_merged

Source Adapters

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "DLNorb/lin_s3w50_dpow50"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True,
)

messages = [{"role": "user", "content": "Convert this to JSON: name=Alice, age=30"}]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
with torch.no_grad():
    output = model.generate(**inputs, max_new_tokens=2048, do_sample=False)
print(tokenizer.decode(output[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True))

Sources & Terms (IMPORTANT)

Training data:

Compliance: Users must comply with each dataset's license (including copyright notice) and the base model's original terms of use.

Downloads last month
31
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DLNorb/lin_s3w50_dpow50

Finetuned
(874)
this model

Datasets used to train DLNorb/lin_s3w50_dpow50