Submission for task submission_dpo_final_0807

🧠 Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.

  • Task ID: sim-task-dpo-0807-final
  • Repo: submission_dpo_final_0807
  • Loss: 3.021432399749756
  • Timestamp: 2025-07-08T14:48:45.001303
Downloads last month

-

Downloads are not tracked for this model. How to track
Safetensors
Model size
4.19M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for raniero/submission_dpo_final_0807

Finetuned
(1096)
this model