NoesisLab
/

Geilim-1B-SR-Instruct

Text Generation

hybrid-architecture

Model card Files Files and versions

OzTianlu commited on Feb 1

Commit

50d539a

·

verified ·

1 Parent(s): ec2aaa8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -302,7 +302,7 @@ Users should:
 ### Environmental Impact
-- **Training**: ~6-8 hours on 2x A100 GPUs
 - **Carbon Footprint**: Estimated ~5-10 kg CO2eq (depends on energy source)
 - **Inference**: Efficient at 1.3B parameters, suitable for edge deployment
@@ -371,7 +371,7 @@ The model uses the **Asterisk** architecture, which combines:
 - ✅ **Full Hybrid**: All 16 layers use ASPP + Attention
 - ✅ **Bilingual**: Serbian + English capabilities
 - ✅ **Reasoning**: Math, code, and general reasoning
-- ✅ **Fast Training**: ~6-8 hours on 2x A100
 - ✅ **Low Memory**: ~3GB inference, ~20GB training per GPU
 ## Hardware Requirements

 ### Environmental Impact
+- **Training**: ~6-8 hours on 1x RTX PRO 6000 GPUs
 - **Carbon Footprint**: Estimated ~5-10 kg CO2eq (depends on energy source)
 - **Inference**: Efficient at 1.3B parameters, suitable for edge deployment
 - ✅ **Full Hybrid**: All 16 layers use ASPP + Attention
 - ✅ **Bilingual**: Serbian + English capabilities
 - ✅ **Reasoning**: Math, code, and general reasoning
+- ✅ **Fast Training**: ~6-8 hours on 1x RTX PRO 6000
 - ✅ **Low Memory**: ~3GB inference, ~20GB training per GPU
 ## Hardware Requirements