tinyrolls/RoleVerse
Preview • Updated • 30.5k • 155
Checkpoint from the RoleVerse project — a multilingual benchmark for social reasoning through same-universe role-playing across 5 languages (EN, ZH, JA, FR, ES).
This checkpoint was produced by the RoleVerse training pipeline (SFT and/or DPO). See the project repo and dataset for details on how it was trained.
tinyrolls/R1_Qwen3_8B_dpofrom transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "tinyrolls/R1_Qwen3_8B_dpo"
tok = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype="auto", device_map="auto")