Gemma Embeddings v0.8
GemmaEmbed is a dense-vector embedding model, trained especially for retrieval. As of December 2, 2024, GemmaEmbed achieves the #1 position overall on the MTEB Retrieval leaderboard, with a score of 63.80.
Important Notes
- This is not an official Google product.
- This is a research project.
Results summary
Results compared to BGE-EN-ICL on several large datasets
| Model | DBPedia | FEVER | HotPotQA | MSMARCO | NQ |
|---|---|---|---|---|---|
| BGE-EN-ICL | 51.63 | 92.83 | 85.14 | 46.79 | 73.88 |
| Gemma-Embeddings-v0.8 | 52.58 | 93.50 | 87.58 | 47.13 | 74.45 |
Model & Data
Our base encoder model is Gemma2 9B.
We use the BGE-EN-ICL training data.
Research Team
- Nicholas Monath
- Michael Boratko
- Seungyeon Kim
- Andrew McCallum
- Rob Fergus
- Manzil Zaheer
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for pipawo1881/Gemma-Embeddings-v0.8
Evaluation results
- ndcg_at_1 on MTEB ArguAna (default)test set self-reported68.350
- ndcg_at_3 on MTEB ArguAna (default)test set self-reported80.894
- ndcg_at_5 on MTEB ArguAna (default)test set self-reported82.664
- ndcg_at_10 on MTEB ArguAna (default)test set self-reported83.828
- ndcg_at_20 on MTEB ArguAna (default)test set self-reported84.084
- ndcg_at_100 on MTEB ArguAna (default)test set self-reported84.280
- ndcg_at_1000 on MTEB ArguAna (default)test set self-reported84.280
- map_at_1 on MTEB ArguAna (default)test set self-reported68.350
- map_at_3 on MTEB ArguAna (default)test set self-reported77.786
- map_at_5 on MTEB ArguAna (default)test set self-reported78.774
- map_at_10 on MTEB ArguAna (default)test set self-reported79.276
- map_at_20 on MTEB ArguAna (default)test set self-reported79.349
- map_at_100 on MTEB ArguAna (default)test set self-reported79.380
- map_at_1000 on MTEB ArguAna (default)test set self-reported79.380
- recall_at_1 on MTEB ArguAna (default)test set self-reported68.350
- recall_at_3 on MTEB ArguAna (default)test set self-reported89.900
- recall_at_5 on MTEB ArguAna (default)test set self-reported94.168
- recall_at_10 on MTEB ArguAna (default)test set self-reported97.653
- recall_at_20 on MTEB ArguAna (default)test set self-reported98.649
- recall_at_100 on MTEB ArguAna (default)test set self-reported99.644