sc-dev
Collection
4 items • Updated
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Qwen-7B")
model = PeftModel.from_pretrained(base_model, "dev-store/sc-dev-p002")This model is a fine-tuned version of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B on the sc_preference_v2 dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
# Gated model: Login with a HF token with gated access permission hf auth login