You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Aura-1B

A 1.0B-parameter multilingual base LLM covering 21 African languages (plus English, French, Portuguese, Arabic for transfer). Trained from scratch with a Llama-3-style architecture (RMSNorm, RoPE, grouped-query attention, SwiGLU) on a temperature-balanced FineWeb2 mixture.

Architecture


Parameters	1.0B (1,013,280,000)
Layers	36
Hidden dim	1280
Attention heads	20 (KV heads: 4, GQA)
FFN intermediate	5120
Context length	1024
Vocab size	64000
RoPE theta	500000
Training step	99,999

Languages

afr_Latn
amh_Ethi
arb_Arab
bem_Latn
eng_Latn
fon_Latn
fra_Latn
hau_Latn
ibo_Latn
kin_Latn
lin_Latn
lug_Latn
nya_Latn
plt_Latn
por_Latn
sna_Latn
som_Latn
sot_Latn
swh_Latn
tir_Ethi
tsn_Latn
wol_Latn
xho_Latn
yor_Latn
zul_Latn

Quick start

git clone https://huggingface.co/WakandaAI/Aura-1B
cd Aura-1B
pip install torch tokenizers safetensors
python generate.py --prompt "<s><|yor_Latn|>Kaabo, mo jẹ awoṣe ede." -n 4

Or from Python:

from inference import load_model, generate

model, tokenizer, config = load_model(".")
out = generate(
    model, tokenizer,
    prompt="<s><|swh_Latn|>Habari yako rafiki?",
    max_new_tokens=128,
    temperature=0.8,
    top_p=0.9,
)
print(out[0])

Prompt format

Every prompt should start with <s> (BOS) followed by a language token of the form <|{lang}_{Script}|> to condition generation on the target language. See tokenizer.json for the full list of language tokens.

Examples:

<s><|eng_Latn|>The quick brown fox
<s><|hau_Latn|>Sannu, yaya kake?
<s><|amh_Ethi|>ሰላም

Interactive mode

For exploring the model, run the REPL:

python generate.py --interactive

Then type prompts at the >>> prompt. Empty line or Ctrl-D exits.

Example session:

>>> <s><|swh_Latn|>Nairobi ni mji mkuu wa
[sample 0]
Nairobi ni mji mkuu wa Kenya na kuna wageni wengi ambao hutembelea pia.
Unaweza kupata historia ya utalii wa ndani ya nchi hii...

Adjust sampling on the command line:

python generate.py --interactive --temperature 0.7 --top-p 0.95 --max-new-tokens 256

Files

File	Purpose
`model.safetensors`	Model weights (preferred format)
`model.pt`	Same weights as a torch checkpoint (fallback)
`config.json`	Architecture config in plain JSON
`tokenizer.json`	ByteLevel BPE tokenizer (64000 vocab)
`tokenizer_config.json`	HuggingFace tokenizer metadata
`inference.py`	`load_model()` + `generate()` library
`generate.py`	CLI wrapper
`llama3.py`, `model_factory.py`, `kvcache.py`	Model definition

Limitations

This is a base model, not chat- or instruction-tuned. It will continue text in the style of its training corpus (web text in the prompted language). For instruction following or chat, fine-tune on an instruction dataset.

Quality varies by language; lower-resource languages in the training mixture (e.g. Bemba, Tswana) will produce lower-quality continuations than higher-resource ones (English, Swahili, Yoruba).

Citation

If you use this model, please cite WakandaAI. Details TBA.

Downloads last month: 2

Safetensors

Model size

1B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including WakandaAI/Aura-1B

Aura Family of LLMs

Collection

4 items • Updated 30 days ago