Update README.md
Browse files
README.md
CHANGED
|
@@ -302,7 +302,7 @@ Users should:
|
|
| 302 |
|
| 303 |
### Environmental Impact
|
| 304 |
|
| 305 |
-
- **Training**: ~6-8 hours on
|
| 306 |
- **Carbon Footprint**: Estimated ~5-10 kg CO2eq (depends on energy source)
|
| 307 |
- **Inference**: Efficient at 1.3B parameters, suitable for edge deployment
|
| 308 |
|
|
@@ -371,7 +371,7 @@ The model uses the **Asterisk** architecture, which combines:
|
|
| 371 |
- ✅ **Full Hybrid**: All 16 layers use ASPP + Attention
|
| 372 |
- ✅ **Bilingual**: Serbian + English capabilities
|
| 373 |
- ✅ **Reasoning**: Math, code, and general reasoning
|
| 374 |
-
- ✅ **Fast Training**: ~6-8 hours on
|
| 375 |
- ✅ **Low Memory**: ~3GB inference, ~20GB training per GPU
|
| 376 |
|
| 377 |
## Hardware Requirements
|
|
|
|
| 302 |
|
| 303 |
### Environmental Impact
|
| 304 |
|
| 305 |
+
- **Training**: ~6-8 hours on 1x RTX PRO 6000 GPUs
|
| 306 |
- **Carbon Footprint**: Estimated ~5-10 kg CO2eq (depends on energy source)
|
| 307 |
- **Inference**: Efficient at 1.3B parameters, suitable for edge deployment
|
| 308 |
|
|
|
|
| 371 |
- ✅ **Full Hybrid**: All 16 layers use ASPP + Attention
|
| 372 |
- ✅ **Bilingual**: Serbian + English capabilities
|
| 373 |
- ✅ **Reasoning**: Math, code, and general reasoning
|
| 374 |
+
- ✅ **Fast Training**: ~6-8 hours on 1x RTX PRO 6000
|
| 375 |
- ✅ **Low Memory**: ~3GB inference, ~20GB training per GPU
|
| 376 |
|
| 377 |
## Hardware Requirements
|