tttx/r1-trajectories-collection-round-2
Viewer • Updated • 2.91k • 52 • 1
How to use tttx/25k_sft_5ep_021025 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Qwen-32B")
model = PeftModel.from_pretrained(base_model, "tttx/25k_sft_5ep_021025")This model is a fine-tuned version of deepseek-ai/Deepseek-R1-Distill-Qwen-32B on the tttx/r1-trajectories-arcagi-barc, the tttx/r1-masked-arcagi-v1, the tttx/r1-barc-r1-feb-6, the tttx/r1-masked-feb-6-p2, the tttx/r1-masked-feb-6-p1, the tttx/r1-trajectories-collection-round-2, the tttx/feb7-masked-trajectories-hp12, the tttx/feb7-masked-trajectories-hp13, the tttx/regular-masked-3k-r1-020925, the tttx/regular-arcagi-3k-r1-020925, the tttx/feb10-qwen-collect-2.2k and the tttx/feb10-qwen-collect-2.7k-hp12 datasets.
More information needed
More information needed
More information needed
The following hyperparameters were used during training: