Qwen3_8B_sft / README.md
tinyrolls's picture
Upload Qwen3_8B_sft
60361d8 verified
metadata
language:
  - en
  - zh
  - ja
  - fr
  - es
license: apache-2.0
tags:
  - role-playing
  - dialogue
  - multilingual
  - dpo
  - sft
datasets:
  - tinyrolls/RoleVerse

Qwen3_8B_20260509_144913

Checkpoint from the RoleVerse project — a multilingual benchmark for social reasoning through same-universe role-playing across 5 languages (EN, ZH, JA, FR, ES).

This checkpoint was produced by the RoleVerse training pipeline (SFT and/or DPO). See the project repo and dataset for details on how it was trained.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "tinyrolls/Qwen3_8B_20260509_144913"
tok = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto", device_map="auto")