Update README.md
Browse files
README.md
CHANGED
|
@@ -134,9 +134,9 @@ This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-a
|
|
| 134 |
|
| 135 |
- **Hardware**: 8× NVIDIA H100-80GB GPUs
|
| 136 |
- **Fine-tuning Method**: LoRA/QLoRA with the following configuration:
|
| 137 |
-
- LoRA Alpha:
|
| 138 |
- LoRA Dropout: 0.05
|
| 139 |
-
- LoRA Rank:
|
| 140 |
- **Quantization**: 4-bit NF4 + Double Quantization + FP16 compute
|
| 141 |
- **Dataset Domains**: Mathematics, coding, reasoning, science, general knowledge, competitive exams, Indian context + law, multilingual (Hindi and Hinglish)
|
| 142 |
- **Synthetic Data Advantage**: +15-20% performance boost in STEM & coding domains
|
|
|
|
| 134 |
|
| 135 |
- **Hardware**: 8× NVIDIA H100-80GB GPUs
|
| 136 |
- **Fine-tuning Method**: LoRA/QLoRA with the following configuration:
|
| 137 |
+
- LoRA Alpha: 16
|
| 138 |
- LoRA Dropout: 0.05
|
| 139 |
+
- LoRA Rank: 16
|
| 140 |
- **Quantization**: 4-bit NF4 + Double Quantization + FP16 compute
|
| 141 |
- **Dataset Domains**: Mathematics, coding, reasoning, science, general knowledge, competitive exams, Indian context + law, multilingual (Hindi and Hinglish)
|
| 142 |
- **Synthetic Data Advantage**: +15-20% performance boost in STEM & coding domains
|