tttx/ttt-bigestrun-021225-night-big-collated
Viewer • Updated • 251 • 5
How to use tttx/models-ttt-diff-buffer-021325-step4 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Qwen-32B")
model = PeftModel.from_pretrained(base_model, "tttx/models-ttt-diff-buffer-021325-step4")This model is a fine-tuned version of tttx/models-ttt-diff-buffer-step3-021325 on the tttx/ttt-bigestrun-021225-night-big-collated dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-R1-Distill-Qwen-32B") model = PeftModel.from_pretrained(base_model, "tttx/models-ttt-diff-buffer-021325-step4")