tttx/3k-forcing-1500-mask25-022425-step2
Viewer • Updated • 1.5k • 224
How to use tttx/model_3k_force_1500_step3_mask25_022425 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Qwen-32B")
model = PeftModel.from_pretrained(base_model, "tttx/model_3k_force_1500_step3_mask25_022425")This model is a fine-tuned version of tttx/sft-32b-020925-19k-5ep on the tttx/3k-forcing-1500-mask25-022425-step2 dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B