brazembed-pt-br-clfclu โ€” BrazEmbed-PT-BR component (Classification & Clustering)

A component of BrazEmbed-PT-BR, the contamination-clean ~110M Brazilian-Portuguese embedding system (task-routed, MTEB(por) mean_16 = 0.6567, #1 in the ~100M class). This standalone SentenceTransformer (Brazilian BERTimbau + the Classification & Clustering clean weight-soup) serves the Classification & Clustering tasks.

from sentence_transformers import SentenceTransformer
m = SentenceTransformer("tardellirs/brazembed-pt-br-clfclu")   # mean-pooling, L2-normalized, no instruction prefix

Use it directly for Classification & Clustering, or via the router (https://github.com/tardellirs/brazembed-pt-br โ†’ route.py). For one general model, use tardellirs/brazembed-pt-br. License MIT. Benchmark: MTEB(por) (soon).

Downloads last month
12
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tardellirs/brazembed-pt-br-clfclu

Finetuned
(210)
this model

Collection including tardellirs/brazembed-pt-br-clfclu