How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="mwitiderrick/SwahiliInstruct-v0.2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("mwitiderrick/SwahiliInstruct-v0.2")
model = AutoModelForCausalLM.from_pretrained("mwitiderrick/SwahiliInstruct-v0.2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

SwahiliInstruct-v0.2

This is a Mistral model that has been fine-tuned on the Swahili Alpaca dataset for 3 epochs.

Prompt Template

### Maelekezo:

{query}

### Jibu:
<Leave new line for model to respond> 

Usage

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("mwitiderrick/SwahiliInstruct-v0.2")
model = AutoModelForCausalLM.from_pretrained("mwitiderrick/SwahiliInstruct-v0.2", device_map="auto")
query = "Nipe maagizo ya kutengeneza mkate wa mandizi"
text_gen = pipeline(task="text-generation", model=model, tokenizer=tokenizer, max_length=200, do_sample=True, repetition_penalty=1.1)
output = text_gen(f"### Maelekezo:\n{query}\n### Jibu:\n")
print(output[0]['generated_text'])


"""
 Maagizo ya kutengeneza mkate wa mandazi:
1. Preheat tanuri hadi 375°F (190°C).
2. Paka sufuria ya uso na siagi au jotoa sufuria.
3. Katika bakuli la chumvi, ongeza viungo vifuatavyo: unga, sukari ya kahawa, chumvi, mdalasini, na unga wa kakao.
Koroga mchanganyiko pamoja na mbegu za kikombe 1 1/2 za mtindi wenye jamii na hatua ya maji nyepesi.
4. Kando ya uwanja, changanya zaini ya yai 2
"""

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 54.25
AI2 Reasoning Challenge (25-Shot) 55.20
HellaSwag (10-Shot) 78.22
MMLU (5-Shot) 50.30
TruthfulQA (0-shot) 57.08
Winogrande (5-shot) 73.24
GSM8k (5-shot) 11.45
Downloads last month
10
Safetensors
Model size
7B params
Tensor type
F16
·
Inference Providers NEW
Input a message to start chatting with mwitiderrick/SwahiliInstruct-v0.2.

Model tree for mwitiderrick/SwahiliInstruct-v0.2

Finetuned
(1088)
this model
Quantizations
1 model

Dataset used to train mwitiderrick/SwahiliInstruct-v0.2

Evaluation results