submission_test_dpo_final
๐ง Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.
sim-task-dpo-test-final
2.242447626590729
-
Files info
Base model