Estwld
/

GENOME-gemma-2b-it

Model card Files Files and versions

Estwld commited on Feb 27, 2025

Commit

01521f1

·

verified ·

1 Parent(s): 9fb9a4d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ This repository contains 10 expert models fine-tuned via low-rank adaptation (Lo
 - **Fine-tuning Framework:** llama-factory
 - **Adaptation Technique:** LoRA
 - **Training Hardware:** 8×A100-80GB GPUs
-- **Note**: Deploying a 2B model only requires 12GB of VRAM. For optimal performance, we recommend using an RTX 3090 (24GB) or a comparable GPU.
 A visualization of the performance (ranks) across various datasets shows that each expert model excels in its respective domain.
 vLLM supports dynamic LoRA switching, allowing seamless adaptation of different expert models with minimal computational overhead, enabling cost-effective optimization.

 - **Fine-tuning Framework:** llama-factory
 - **Adaptation Technique:** LoRA
 - **Training Hardware:** 8×A100-80GB GPUs
+- **Note**: Deploying a 2B model only requires 12GB of VRAM. For optimal performance, we recommend using an RTX 3090/4090 (24GB) or a comparable GPU.
 A visualization of the performance (ranks) across various datasets shows that each expert model excels in its respective domain.
 vLLM supports dynamic LoRA switching, allowing seamless adaptation of different expert models with minimal computational overhead, enabling cost-effective optimization.