okaforpascal40 commited on
Commit
4401ccb
Β·
verified Β·
1 Parent(s): 6b28cef

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +106 -13
README.md CHANGED
@@ -1,21 +1,114 @@
1
  ---
2
- base_model: unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
  - llama
8
- license: apache-2.0
9
- language:
10
- - en
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
- # Uploaded finetuned model
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
- - **Developed by:** okaforpascal40
16
- - **License:** apache-2.0
17
- - **Finetuned from model :** unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit
 
 
 
 
 
 
 
 
 
 
 
18
 
19
- This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
2
+ language:
3
+ - ha
4
+ - ig
5
+ - pcm
6
+ - yo
7
+ - sw
8
+ - am
9
+ - so
10
+ - xh
11
+ - sn
12
+ - lg
13
+ - ln
14
+ - zu
15
+ - wo
16
+ - om
17
+ license: apache-2.0
18
  tags:
19
+ - africa
 
 
20
  - llama
21
+ - fine-tuned
22
+ - multilingual
23
+ ---
24
+
25
+ # 🌳 BaobabAI v0.2
26
+
27
+ **Africa's first continental AI model** β€” built to serve 1.4 billion Africans in their own languages.
28
+
29
+ Named after the Baobab tree β€” the Tree of Life across Africa. Lives 2,000+ years. Feeds entire ecosystems. That is the vision for this model.
30
+
31
+ ---
32
+
33
+ ## πŸš€ What's New in v0.2
34
+ - 3x more training data β€” 86,604 training pairs
35
+ - 5x more training steps β€” 500 steps
36
+ - 4 new languages added
37
+ - Training loss improved from 2.81 β†’ 2.07
38
+ - New data sources: Glot500 + CulturaX + Masakhane
39
+
40
  ---
41
 
42
+ ## 🌍 Languages Supported
43
+ | Language | Country | Speakers |
44
+ |----------|---------|----------|
45
+ | Hausa | Nigeria/Niger/Chad | 70M+ |
46
+ | Yoruba | Nigeria/Benin/Togo | 45M+ |
47
+ | Igbo | Nigeria | 30M+ |
48
+ | Nigerian Pidgin | West Africa | 75M+ |
49
+ | Swahili | Kenya/Tanzania | 200M+ |
50
+ | Amharic | Ethiopia | 35M+ |
51
+ | Somali | Somalia/Kenya | 20M+ |
52
+ | Xhosa | South Africa | 27M+ |
53
+ | Shona | Zimbabwe | 15M+ |
54
+ | Luganda | Uganda | 8M+ |
55
+ | Lingala | DRC/Congo | 70M+ |
56
+ | Zulu | South Africa | 27M+ |
57
+ | Wolof | Senegal/Gambia | 12M+ |
58
+ | Oromo | Ethiopia/Kenya | 40M+ |
59
 
60
+ ---
61
+
62
+ ## πŸ“Š Training Details
63
+ | Parameter | Value |
64
+ |-----------|-------|
65
+ | Base Model | Llama 3.2 3B Instruct |
66
+ | Training pairs | 86,604 |
67
+ | Training steps | 500 |
68
+ | Final loss | 2.07 |
69
+ | Method | QLoRA (r=16) via Unsloth |
70
+ | Hardware | NVIDIA Tesla T4 |
71
+ | Data sources | Masakhane, Glot500, CulturaX |
72
+
73
+ ---
74
 
75
+ ## πŸ’‘ What BaobabAI Can Do
76
+ - Summarize African news articles in local languages
77
+ - Identify African languages automatically
78
+ - Answer questions about African topics
79
+ - Process Nigerian Pidgin English natively
80
+ - Classify African content by category
81
+
82
+ ---
83
+
84
+ ## πŸ”§ Quick Start
85
+ ```python
86
+ from transformers import AutoModelForCausalLM, AutoTokenizer
87
+
88
+ model = AutoModelForCausalLM.from_pretrained("okaforpascal40/BaobabAI-v0.2")
89
+ tokenizer = AutoTokenizer.from_pretrained("okaforpascal40/BaobabAI-v0.2")
90
+ ```
91
+
92
+ ---
93
+
94
+ ## πŸ—ΊοΈ Roadmap
95
+ - v0.3 β€” 20 languages, 200,000+ training pairs
96
+ - v0.5 β€” Upgrade to 8B model
97
+ - v1.0 β€” Live REST API, enterprise ready
98
+ - v2.0 β€” Continental dominance, 50+ languages
99
+
100
+ ---
101
+
102
+ ## πŸ‘¨πŸΏβ€πŸ’» Built By
103
+ **Pascal Okafor Ogbonna**
104
+ Founder & CTO, SabiFlow Technologies Limited
105
+ 🌍 Nigeria | github.com/okaforpascal400/BaobabAI
106
+
107
+ ---
108
+
109
+ ## πŸ“œ License
110
+ Apache 2.0 β€” free to use, modify and build on.
111
+
112
+ ---
113
 
114
+ *The tree is planted. The continent will feel its shade.* 🌳