okaforpascal40 commited on
Commit
2e90ab7
Β·
verified Β·
1 Parent(s): 4401ccb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -12
README.md CHANGED
@@ -24,9 +24,11 @@ tags:
24
 
25
  # 🌳 BaobabAI v0.2
26
 
27
- **Africa's first continental AI model** β€” built to serve 1.4 billion Africans in their own languages.
 
28
 
29
- Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ years. Feeds entire ecosystems. That is the vision for this model.
 
30
 
31
  ---
32
 
@@ -34,7 +36,7 @@ Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ yea
34
  - 3x more training data β€” 86,604 training pairs
35
  - 5x more training steps β€” 500 steps
36
  - 4 new languages added
37
- - Training loss improved from 2.81 β†’ 2.07
38
  - New data sources: Glot500 + CulturaX + Masakhane
39
 
40
  ---
@@ -50,10 +52,10 @@ Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ yea
50
  | Amharic | Ethiopia | 35M+ |
51
  | Somali | Somalia/Kenya | 20M+ |
52
  | Xhosa | South Africa | 27M+ |
 
53
  | Shona | Zimbabwe | 15M+ |
54
  | Luganda | Uganda | 8M+ |
55
  | Lingala | DRC/Congo | 70M+ |
56
- | Zulu | South Africa | 27M+ |
57
  | Wolof | Senegal/Gambia | 12M+ |
58
  | Oromo | Ethiopia/Kenya | 40M+ |
59
 
@@ -63,12 +65,12 @@ Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ yea
63
  | Parameter | Value |
64
  |-----------|-------|
65
  | Base Model | Llama 3.2 3B Instruct |
66
- | Training pairs | 86,604 |
67
- | Training steps | 500 |
68
- | Final loss | 2.07 |
69
  | Method | QLoRA (r=16) via Unsloth |
70
  | Hardware | NVIDIA Tesla T4 |
71
- | Data sources | Masakhane, Glot500, CulturaX |
72
 
73
  ---
74
 
@@ -85,8 +87,8 @@ Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ yea
85
  ```python
86
  from transformers import AutoModelForCausalLM, AutoTokenizer
87
 
88
- model = AutoModelForCausalLM.from_pretrained("okaforpascal40/BaobabAI-v0.2")
89
  tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
 
90
  ```
91
 
92
  ---
@@ -100,9 +102,9 @@ tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
100
  ---
101
 
102
  ## πŸ‘¨πŸΏβ€πŸ’» Built By
103
- **Pascal Okafor Ogbonna**
104
- Founder & CTO, SabiFlow Technologies Limited
105
- 🌍 Nigeria | github.com/okaforpascal400/BaobabAI
106
 
107
  ---
108
 
 
24
 
25
  # 🌳 BaobabAI v0.2
26
 
27
+ **Africa's first continental AI model** β€” now with 15 languages
28
+ and 86,604 training pairs.
29
 
30
+ Named after the Baobab tree β€” the Tree of Life across Africa.
31
+ Lives 2,000+ years. Feeds entire ecosystems. That is the vision.
32
 
33
  ---
34
 
 
36
  - 3x more training data β€” 86,604 training pairs
37
  - 5x more training steps β€” 500 steps
38
  - 4 new languages added
39
+ - Training loss improved: 2.81 β†’ 2.07
40
  - New data sources: Glot500 + CulturaX + Masakhane
41
 
42
  ---
 
52
  | Amharic | Ethiopia | 35M+ |
53
  | Somali | Somalia/Kenya | 20M+ |
54
  | Xhosa | South Africa | 27M+ |
55
+ | Zulu | South Africa | 27M+ |
56
  | Shona | Zimbabwe | 15M+ |
57
  | Luganda | Uganda | 8M+ |
58
  | Lingala | DRC/Congo | 70M+ |
 
59
  | Wolof | Senegal/Gambia | 12M+ |
60
  | Oromo | Ethiopia/Kenya | 40M+ |
61
 
 
65
  | Parameter | Value |
66
  |-----------|-------|
67
  | Base Model | Llama 3.2 3B Instruct |
68
+ | Training Pairs | 86,604 |
69
+ | Training Steps | 500 |
70
+ | Final Loss | 2.07 |
71
  | Method | QLoRA (r=16) via Unsloth |
72
  | Hardware | NVIDIA Tesla T4 |
73
+ | Data Sources | Masakhane, Glot500, CulturaX |
74
 
75
  ---
76
 
 
87
  ```python
88
  from transformers import AutoModelForCausalLM, AutoTokenizer
89
 
 
90
  tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
91
+ model = AutoModelForCausalLM.from_pretrained("okaforpascal40/BaobabAI-v0.2")
92
  ```
93
 
94
  ---
 
102
  ---
103
 
104
  ## πŸ‘¨πŸΏβ€πŸ’» Built By
105
+ Pascal Okafor Ogbonna | SabiFlow Technologies Limited
106
+ 🌍 [baobabai.dev](https://baobabai.dev) ·
107
+ [GitHub](https://github.com/okaforpascal400/BaobabAI)
108
 
109
  ---
110