submission_test_dpo_0807_001
๐ง Fine-tuned using LoRA on a dynamic dataset generated from LLaMA.
sim-task-dpo-0807-001
4.867627779642741
-
Files info
Base model