brazembed-pt-br-sts โ€” BrazEmbed-PT-BR component (Semantic similarity (STS))

A component of BrazEmbed-PT-BR, the contamination-clean ~110M Brazilian-Portuguese embedding system (task-routed, MTEB(por) mean_16 = 0.6567, #1 in the ~100M class). This standalone SentenceTransformer (Brazilian BERTimbau + the Semantic similarity (STS) clean weight-soup) serves the Semantic similarity (STS) tasks.

from sentence_transformers import SentenceTransformer
m = SentenceTransformer("tardellirs/brazembed-pt-br-sts")   # mean-pooling, L2-normalized, no instruction prefix

Use it directly for Semantic similarity (STS), or via the router (https://github.com/tardellirs/brazembed-pt-br โ†’ route.py). For one general model, use tardellirs/brazembed-pt-br. License MIT. Benchmark: MTEB(por) (leaderboard).

Downloads last month
12
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tardellirs/brazembed-pt-br-sts

Finetuned
(211)
this model

Collection including tardellirs/brazembed-pt-br-sts