submission_dpo_final_0807
๐ง Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.
sim-task-dpo-0807-final
3.021432399749756
-
Files info
Base model