spankevich
/

llm-course-hw3-dora

Text Generation

text-generation-inference

Model card Files Files and versions

spankevich commited on Mar 29, 2025

Commit

002d4fd

·

verified ·

1 Parent(s): c11e870

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ base_model:
 # Model Card for Model ID
  it was used to fine-tune OuteAI/Lite-Oute-1-300M-Instruct for tweet tone classification problem.
- Default model achieved 0.08 f1-score, while fine-tuned version achieved 0.53 f1-score in less than 8 minutes of fine-tuning on a single A100
 #### Prameters
@@ -22,7 +22,7 @@ DoRA was used with r=8 and alpha=16 to fine-tune "k_proj", "v_proj".
 #### Training parameters
 BATCH_SIZE = 32
 LEARNING_RATE = 3e-4
-NUM_EPOCHS = 1
 #### Metrics

 # Model Card for Model ID
  it was used to fine-tune OuteAI/Lite-Oute-1-300M-Instruct for tweet tone classification problem.
+ Default model achieved 0.08 f1-score, while fine-tuned version achieved 0.51 f1-score in less than 15 minutes of fine-tuning on a single A100
 #### Prameters
 #### Training parameters
 BATCH_SIZE = 32
 LEARNING_RATE = 3e-4
+NUM_EPOCHS = 2
 #### Metrics