ayymen/Weblate-Translations
Viewer • Updated • 11.7M • 782 • 19
How to use boffire/kabyle-sentence-transformer-mpnet with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("boffire/kabyle-sentence-transformer-mpnet")
sentences = [
"That is a happy person",
"That is a happy dog",
"That is a very happy person",
"Today is a sunny day"
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]A sentence embedding model specifically fine-tuned for Kabyle (Taqbaylit) - English cross-lingual semantic similarity.
| Attribute | Value |
|---|---|
| Base model | sentence-transformers/paraphrase-multilingual-mpnet-base-v2 |
| Fine-tuning data | ~2.5M unique EN–KAB parallel sentences |
| Embedding dimension | 768 |
| Training framework | SentenceTransformers |
| Training time | ~1h 16min (1 epoch, 15,593 steps) |
| Final loss | 0.043 (started at 0.278) |
| Source | Pairs | Description |
|---|---|---|
| NLLB (cleaned) | ~2.35M | Diverse domain parallel corpus |
| Tatoeba + CS | ~202K | Community translations + software localization |
| Weblate | ~9K | FLOSS UI strings |
| LibreTranslate | ~449 | User-reviewed translations |
Compared to the base paraphrase-multilingual-mpnet-base-v2 (untrained):
| Metric | Base | This Model | Gain |
|---|---|---|---|
| Avg. cosine similarity (EN<->KAB) | 0.278 | 0.857 | +58 points |
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("boffire/kabyle-sentence-transformer-mpnet")
# Embed English and Kabyle
sentences = ["Hello!", "Azul!"]
embeddings = model.encode(sentences)
# Cross-lingual similarity
from sklearn.metrics.pairwise import cosine_similarity
sim = cosine_similarity([embeddings[0]], [embeddings[1]])
print(sim)
Davlan/afro-xlmr-large backbone for African-specific pretrainingAvgCosineEvaluator instead of correlation-based metricsIf you use this model, please cite:
@misc{kabyle-st-mpnet,
title={Kabyle Sentence Transformer},
author={boffire},
year={2026},
howpublished={\url{https://huggingface.co/boffire/kabyle-sentence-transformer-mpnet}}
}