Austin207
/

Map-NEO

@@ -53,11 +53,11 @@ model-index:
 ## Key Features
-- 🚀 **Efficient Training**: Trained on RTX 5070 (8GB VRAM) in ~4 hours
-- 📏 **Extended Context**: 16,384 token context window (16x typical small models)
-- 🧠 **Memory Efficient**: Only 1.3GB VRAM for 1,800 tokens inference
-- ⚡ **Fast Inference**: ~10 tokens/second on consumer GPU
-- 🎯 **High Quality Data**: Trained on curated RefinedWeb subset
 ## Architecture Details
@@ -136,7 +136,7 @@ model-index:
 ## Usage
 ### Quick Start
----
 import torch
 from transformers import AutoTokenizer
 from model_neo import NeoMini, NeoMiniConfig
@@ -157,11 +157,11 @@ input_ids = tokenizer.encode(prompt, return_tensors="pt")
 with torch.no_grad():
     output = model.generate(input_ids, max_length=100, temperature=0.8)
 print(tokenizer.decode(output))
----
 ### Interactive Chat
----
 python interactive_chat.py
----
 ### Generation Parameters
 - **Temperature**: 0.7-0.9 for creative tasks, 0.3-0.5 for factual
@@ -204,7 +204,7 @@ python interactive_chat.py
 ## Environmental Impact
 ### Carbon Footprint
-- **Training Hardware**: Single RTX 5070 (200W)
 - **Training Time**: 4 hours
 - **Estimated CO₂**: ~0.3 kg CO₂ equivalent
 - **Efficiency**: 253M parameters per 0.3 kg CO₂
@@ -216,7 +216,7 @@ python interactive_chat.py
 ## Citation
----
 @misc{mapneo_mini_2025,
   title={MAP-NEO Mini: An Efficient 253M Parameter Language Model},
   author={[Antony Austin]},
@@ -224,12 +224,12 @@ python interactive_chat.py
   howpublished={\url{https://huggingface.co/[Austin207]/map-neo-mini}},
   note={Trained on NVIDIA RTX 5070 with RefinedWeb data}
 }
----
 ## Technical Details
 ### Files Structure
----
 map-neo-mini/
 ├── config.json                 # Model configuration
 ├── pytorch_model.bin           # Model weights
@@ -239,7 +239,7 @@ map-neo-mini/
 ├── vocab.json                  # Vocabulary
 ├── merges.txt                  # BPE merges
 └── model_neo.py               # Model architecture code
----
 ### Hardware Requirements
 - **Minimum**: 4GB VRAM for inference

 ## Key Features
+-  **Efficient Training**: Trained on RTX 5070 (8GB VRAM) in ~4 hours
+-  **Extended Context**: 16,384 token context window (16x typical small models)
+-  **Memory Efficient**: Only 1.3GB VRAM for 1,800 tokens inference
+-  **Fast Inference**: ~10 tokens/second on consumer GPU
+-  **High Quality Data**: Trained on curated RefinedWeb subset
 ## Architecture Details
 ## Usage
 ### Quick Start
+```
 import torch
 from transformers import AutoTokenizer
 from model_neo import NeoMini, NeoMiniConfig
 with torch.no_grad():
     output = model.generate(input_ids, max_length=100, temperature=0.8)
 print(tokenizer.decode(output))
+```
 ### Interactive Chat
+```
 python interactive_chat.py
+```
 ### Generation Parameters
 - **Temperature**: 0.7-0.9 for creative tasks, 0.3-0.5 for factual
 ## Environmental Impact
 ### Carbon Footprint
+- **Training Hardware**: Single RTX 5070 Laptop GPU (100W)
 - **Training Time**: 4 hours
 - **Estimated CO₂**: ~0.3 kg CO₂ equivalent
 - **Efficiency**: 253M parameters per 0.3 kg CO₂
 ## Citation
+```
 @misc{mapneo_mini_2025,
   title={MAP-NEO Mini: An Efficient 253M Parameter Language Model},
   author={[Antony Austin]},
   howpublished={\url{https://huggingface.co/[Austin207]/map-neo-mini}},
   note={Trained on NVIDIA RTX 5070 with RefinedWeb data}
 }
+```
 ## Technical Details
 ### Files Structure
+```
 map-neo-mini/
 ├── config.json                 # Model configuration
 ├── pytorch_model.bin           # Model weights
 ├── vocab.json                  # Vocabulary
 ├── merges.txt                  # BPE merges
 └── model_neo.py               # Model architecture code
+```
 ### Hardware Requirements
 - **Minimum**: 4GB VRAM for inference