AquaLabs
/

EchoLLaMA-1B

Text Generation

text-generation-inference

Model card Files Files and versions

suayptalha commited on Apr 9, 2025

Commit

69130a8

·

verified ·

1 Parent(s): 3a3ebac

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -54,7 +54,7 @@ The EchoLLaMA pipeline integrates four specialized models:
 The LLaMA-3.2-1B-Instruct model was fine-tuned using:
 - **Technique**: Direct Preference Optimization (DPO) with LoRA
-- **Dataset**: 2000 samples from COCO 2017 processed with DETR, MiDaS, and Moondream
 - **Chosen Responses**: Generated by DeepSeek-V3-0324
 - **Rejected Responses**: Generated by pre-fine-tuned LLaMA-3.2-1B-Instruct
 - **Training Parameters**:

 The LLaMA-3.2-1B-Instruct model was fine-tuned using:
 - **Technique**: Direct Preference Optimization (DPO) with LoRA
+- **Dataset**: 2000 samples from COCO 2017 processed with DETR, and Moondream
 - **Chosen Responses**: Generated by DeepSeek-V3-0324
 - **Rejected Responses**: Generated by pre-fine-tuned LLaMA-3.2-1B-Instruct
 - **Training Parameters**: