Anonymous Chainer SFT Checkpoint

This repository contains a LoRA adapter checkpoint for the Chainer policy after supervised fine-tuning.

Base model

  • Qwen/Qwen3-4B-Instruct-2507

Artifact type

  • PEFT LoRA adapter
  • Inference-only checkpoint export

Notes

  • This repository is provided as anonymous supplementary material for peer review.
  • The checkpoint is released for verification and follow-up research use.
  • The repository intentionally omits author-identifying metadata.

Loading example

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model_id = "Qwen/Qwen3-4B-Instruct-2507"
adapter_repo_id = "Anonymous5345345/anonymous-chainer-sft"

tokenizer = AutoTokenizer.from_pretrained(base_model_id)
base_model = AutoModelForCausalLM.from_pretrained(base_model_id)
model = PeftModel.from_pretrained(base_model, adapter_repo_id)
Downloads last month
35
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Anonymous5345345/anonymous-chainer-sft

Adapter
(5513)
this model