submission_test_dpo_0807_002
๐ง Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.
sim-task-dpo-0807-002
3.021432399749756
-
Files info
Base model