raniero
/

submission_dpo_final_0807

Model card Files Files and versions

Submission for task `submission_dpo_final_0807`

🧠 Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.

Task ID: sim-task-dpo-0807-final
Repo: submission_dpo_final_0807
Loss: 3.021432399749756
Timestamp: 2025-07-08T14:48:45.001303

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

4.19M params

Tensor type

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for raniero/submission_dpo_final_0807

Base model

meta-llama/Llama-2-7b-hf

Finetuned

(974)

this model