Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ base_model:
|
|
| 11 |
# Model Card for Model ID
|
| 12 |
|
| 13 |
it was used to fine-tune OuteAI/Lite-Oute-1-300M-Instruct for tweet tone classification problem.
|
| 14 |
-
Default model achieved 0.08 f1-score, while fine-tuned version achieved 0.
|
| 15 |
|
| 16 |
|
| 17 |
#### Prameters
|
|
@@ -22,7 +22,7 @@ DoRA was used with r=8 and alpha=16 to fine-tune "k_proj", "v_proj".
|
|
| 22 |
#### Training parameters
|
| 23 |
BATCH_SIZE = 32
|
| 24 |
LEARNING_RATE = 3e-4
|
| 25 |
-
NUM_EPOCHS =
|
| 26 |
|
| 27 |
|
| 28 |
#### Metrics
|
|
|
|
| 11 |
# Model Card for Model ID
|
| 12 |
|
| 13 |
it was used to fine-tune OuteAI/Lite-Oute-1-300M-Instruct for tweet tone classification problem.
|
| 14 |
+
Default model achieved 0.08 f1-score, while fine-tuned version achieved 0.51 f1-score in less than 15 minutes of fine-tuning on a single A100
|
| 15 |
|
| 16 |
|
| 17 |
#### Prameters
|
|
|
|
| 22 |
#### Training parameters
|
| 23 |
BATCH_SIZE = 32
|
| 24 |
LEARNING_RATE = 3e-4
|
| 25 |
+
NUM_EPOCHS = 2
|
| 26 |
|
| 27 |
|
| 28 |
#### Metrics
|