MTEB(por) Leaderboard
Massive Text Embedding Benchmark for Brazilian Portuguese
None defined yet.
A public benchmark for evaluating text embedding models on Brazilian Portuguese, built on top of the mteb library.
Two channels β pick whichever fits:
Required for a submission:
model_id (HF repo path or vendor product name)We re-run a sample of each submission to verify before merging.
Open a GitHub Issue with the task template describing the dataset, license, size, and discrimination evidence. A task is accepted if it's native PT-BR (not machine-translated), has clear licensing, and discriminates between models.
Tardelli Stekel β IFSP, SΓ£o Paulo, Brazil
βοΈ stekel@ifsp.edu.br
Contributions, corrections, and discussion all welcome.
@misc{mteb-portuguese-2026,
title = {MTEB Portuguese: A Massive Text Embedding Benchmark for Brazilian Portuguese},
author = {Stekel, Tardelli},
year = {2026},
url = {https://huggingface.co/spaces/mteb-pt/leaderboard}
}
Built on top of the mteb library (Muennighoff et al., 2023). The multilingual sub-benchmark methodology follows MMTEB (Enevoldsen et al., 2025). Task datasets contributed by their original authors β see the task suite for sources.