MTEB‑PT

MTEB-PT

A Text Embedding Benchmark for Brazilian Portuguese

Native, not translated. 54 embedding models ranked on 16 Brazilian-Portuguese tasks, built only from text written in Portuguese (no machine-translated benchmarks), with confidence intervals, significance tests, and an analysis of which tasks actually separate models.

54
Models
16
Native tasks
6
Categories
46 + 8
Open + closed
Live

The leaderboard

54 models on native Brazilian-Portuguese tasks, ranked by the 16-task mean. The top 15 are shown below; the full interactive table, the IRT ranking, and per-category views live on Hugging Face.

Open-weight models: quality versus size on a log scale. The dashed line is the Pareto frontier, the best 16-task mean reachable at each parameter budget. Hover any point for details.

#ModelParamsLicensemean16
1gemini-embedding-001 CLOSEDproprietary0.744
2text-embedding-3-large CLOSEDproprietary0.733
3Qwen3-Embedding-8B OPEN8BApache-2.00.733
4gemini-embedding-2 CLOSEDproprietary0.731
5Octen-Embedding-8B OPEN8BApache-2.00.728
6embeddinggemma-300m OPEN300MGemma0.726
7voyage-4-large CLOSEDproprietary0.724
8harrier-oss-v1-27b OPEN27BMIT0.722
9embed-v4 CLOSEDproprietary0.722
10Qwen3-Embedding-4B OPEN4BApache-2.00.718
11KaLM-Embedding-Gemma3-12B-2511 OPEN11.8BTencent-KaLM0.711
12F2LLM-v2-8B OPEN8BApache-2.00.711
13text-embedding-3-small CLOSEDproprietary0.710
14harrier-oss-v1-0.6b OPEN0.6BMIT0.699
15F2LLM-v2-14B OPEN14BApache-2.00.691
Writing

From the blog

Cite

The paper

A full write-up (benchmark design, the statistical layer, IRT task discrimination, and a cross-leaderboard validity analysis) is in preparation and will be posted as an arXiv preprint (cs.CL). A citation will appear here when it is live.

Contribute

Submit a model

Want your embedding model on the leaderboard? We accept submissions through either channel; pick whichever fits. Every score is reproducible from public scripts, so each new row can be audited.

Hugging Face discussion

Share the model ID and any prompt or pooling details. Best for a quick request.

Open a discussion ↗

GitHub issue

Prefer the code side? File an issue or a pull request on the benchmark repository.

Open an issue ↗