Update README.md
Browse files
README.md
CHANGED
|
@@ -24,9 +24,11 @@ tags:
|
|
| 24 |
|
| 25 |
# π³ BaobabAI v0.2
|
| 26 |
|
| 27 |
-
**Africa's first continental AI model** β
|
|
|
|
| 28 |
|
| 29 |
-
Named after the Baobab tree β the Tree of Life across Africa.
|
|
|
|
| 30 |
|
| 31 |
---
|
| 32 |
|
|
@@ -34,7 +36,7 @@ Named after the Baobab tree β the Tree of Life across Africa. Lives 2,000+ yea
|
|
| 34 |
- 3x more training data β 86,604 training pairs
|
| 35 |
- 5x more training steps β 500 steps
|
| 36 |
- 4 new languages added
|
| 37 |
-
- Training loss improved
|
| 38 |
- New data sources: Glot500 + CulturaX + Masakhane
|
| 39 |
|
| 40 |
---
|
|
@@ -50,10 +52,10 @@ Named after the Baobab tree β the Tree of Life across Africa. Lives 2,000+ yea
|
|
| 50 |
| Amharic | Ethiopia | 35M+ |
|
| 51 |
| Somali | Somalia/Kenya | 20M+ |
|
| 52 |
| Xhosa | South Africa | 27M+ |
|
|
|
|
| 53 |
| Shona | Zimbabwe | 15M+ |
|
| 54 |
| Luganda | Uganda | 8M+ |
|
| 55 |
| Lingala | DRC/Congo | 70M+ |
|
| 56 |
-
| Zulu | South Africa | 27M+ |
|
| 57 |
| Wolof | Senegal/Gambia | 12M+ |
|
| 58 |
| Oromo | Ethiopia/Kenya | 40M+ |
|
| 59 |
|
|
@@ -63,12 +65,12 @@ Named after the Baobab tree β the Tree of Life across Africa. Lives 2,000+ yea
|
|
| 63 |
| Parameter | Value |
|
| 64 |
|-----------|-------|
|
| 65 |
| Base Model | Llama 3.2 3B Instruct |
|
| 66 |
-
| Training
|
| 67 |
-
| Training
|
| 68 |
-
| Final
|
| 69 |
| Method | QLoRA (r=16) via Unsloth |
|
| 70 |
| Hardware | NVIDIA Tesla T4 |
|
| 71 |
-
| Data
|
| 72 |
|
| 73 |
---
|
| 74 |
|
|
@@ -85,8 +87,8 @@ Named after the Baobab tree β the Tree of Life across Africa. Lives 2,000+ yea
|
|
| 85 |
```python
|
| 86 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
| 87 |
|
| 88 |
-
model = AutoModelForCausalLM.from_pretrained("okaforpascal40/BaobabAI-v0.2")
|
| 89 |
tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
|
|
|
|
| 90 |
```
|
| 91 |
|
| 92 |
---
|
|
@@ -100,9 +102,9 @@ tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
|
|
| 100 |
---
|
| 101 |
|
| 102 |
## π¨πΏβπ» Built By
|
| 103 |
-
|
| 104 |
-
|
| 105 |
-
|
| 106 |
|
| 107 |
---
|
| 108 |
|
|
|
|
| 24 |
|
| 25 |
# π³ BaobabAI v0.2
|
| 26 |
|
| 27 |
+
**Africa's first continental AI model** β now with 15 languages
|
| 28 |
+
and 86,604 training pairs.
|
| 29 |
|
| 30 |
+
Named after the Baobab tree β the Tree of Life across Africa.
|
| 31 |
+
Lives 2,000+ years. Feeds entire ecosystems. That is the vision.
|
| 32 |
|
| 33 |
---
|
| 34 |
|
|
|
|
| 36 |
- 3x more training data β 86,604 training pairs
|
| 37 |
- 5x more training steps β 500 steps
|
| 38 |
- 4 new languages added
|
| 39 |
+
- Training loss improved: 2.81 β 2.07
|
| 40 |
- New data sources: Glot500 + CulturaX + Masakhane
|
| 41 |
|
| 42 |
---
|
|
|
|
| 52 |
| Amharic | Ethiopia | 35M+ |
|
| 53 |
| Somali | Somalia/Kenya | 20M+ |
|
| 54 |
| Xhosa | South Africa | 27M+ |
|
| 55 |
+
| Zulu | South Africa | 27M+ |
|
| 56 |
| Shona | Zimbabwe | 15M+ |
|
| 57 |
| Luganda | Uganda | 8M+ |
|
| 58 |
| Lingala | DRC/Congo | 70M+ |
|
|
|
|
| 59 |
| Wolof | Senegal/Gambia | 12M+ |
|
| 60 |
| Oromo | Ethiopia/Kenya | 40M+ |
|
| 61 |
|
|
|
|
| 65 |
| Parameter | Value |
|
| 66 |
|-----------|-------|
|
| 67 |
| Base Model | Llama 3.2 3B Instruct |
|
| 68 |
+
| Training Pairs | 86,604 |
|
| 69 |
+
| Training Steps | 500 |
|
| 70 |
+
| Final Loss | 2.07 |
|
| 71 |
| Method | QLoRA (r=16) via Unsloth |
|
| 72 |
| Hardware | NVIDIA Tesla T4 |
|
| 73 |
+
| Data Sources | Masakhane, Glot500, CulturaX |
|
| 74 |
|
| 75 |
---
|
| 76 |
|
|
|
|
| 87 |
```python
|
| 88 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
| 89 |
|
|
|
|
| 90 |
tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
|
| 91 |
+
model = AutoModelForCausalLM.from_pretrained("okaforpascal40/BaobabAI-v0.2")
|
| 92 |
```
|
| 93 |
|
| 94 |
---
|
|
|
|
| 102 |
---
|
| 103 |
|
| 104 |
## π¨πΏβπ» Built By
|
| 105 |
+
Pascal Okafor Ogbonna | SabiFlow Technologies Limited
|
| 106 |
+
π [baobabai.dev](https://baobabai.dev) Β·
|
| 107 |
+
[GitHub](https://github.com/okaforpascal400/BaobabAI)
|
| 108 |
|
| 109 |
---
|
| 110 |
|