Submission for task submission_dpo_test_llama2

๐Ÿง  Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.

  • Task ID: test-dpo-002
  • Repo: submission_dpo_test_llama2
  • Loss: 4.466731508572896
  • Timestamp: 2025-07-07T07:07:08.438253
Downloads last month

-

Downloads are not tracked for this model. How to track
Safetensors
Model size
4.19M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for raniero/submission_dpo_test_llama2

Finetuned
(1092)
this model