Sentence Similarity
ONNX
Safetensors
English
ogma
embeddings
dense-retrieval
matryoshka
rag
agents
mteb
semantic-search
text-embeddings
text-embedding
vector-search
document-retrieval
similarity-search
classification
clustering
edge-ai
on-device
local-inference
efficient-ai
rag-retrieval
custom_code
Eval Results (legacy)
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -100,7 +100,7 @@ pipeline_tag: sentence-similarity
|
|
| 100 |
|
| 101 |
> Small English text embedding model for semantic search, RAG, vector search, clustering, classification, and agent memory — MTEB 53.06, 3.5M parameters, 1024-token context
|
| 102 |
|
| 103 |
-
**Ogma Mini** is built for edge and resource-constrained deployment. At 3.5M parameters and 14 MB it scores **53.06 MTEB** in our
|
| 104 |
|
| 105 |
## Why the name Ogma?
|
| 106 |
|
|
@@ -136,9 +136,9 @@ Choose ogma-mini when you want a stronger model than micro while staying tiny en
|
|
| 136 |
|
| 137 |
## Performance
|
| 138 |
|
| 139 |
-
### MTEB English —
|
| 140 |
|
| 141 |
-
Benchmarked with [MTEB](https://github.com/embeddings-benchmark/mteb) v2.10.7 on the standard
|
| 142 |
|
| 143 |
| Category | ogma-mini | all-MiniLM-L6-v2 | Δ vs MiniLM |
|
| 144 |
|---|---|---|---|
|
|
@@ -333,7 +333,7 @@ for d in model.config.matryoshka_dims:
|
|
| 333 |
| *potion-base-32M* | 32.0M | 123 MB | *51.22* | 66.0 | 39.2 | 78.2 | 50.9 | 32.2 | 73.9 | 29.8 | 256 | inf |
|
| 334 |
| *potion-base-8M* | 7.6M | 29 MB | *50.03* | 64.44 | 32.93 | 76.62 | 49.73 | 31.71 | 73.24 | 29.28 | 256 | inf |
|
| 335 |
|
| 336 |
-
All Ogma: MTEB 2.10.7,
|
| 337 |
MiniLM/Potion: published scores from the [Model2Vec results page](https://github.com/MinishLab/model2vec/blob/main/results/README.md).
|
| 338 |
|
| 339 |
---
|
|
|
|
| 100 |
|
| 101 |
> Small English text embedding model for semantic search, RAG, vector search, clustering, classification, and agent memory — MTEB 53.06, 3.5M parameters, 1024-token context
|
| 102 |
|
| 103 |
+
**Ogma Mini** is built for edge and resource-constrained deployment. At 3.5M parameters and 14 MB it scores **53.06 MTEB** in our 66-task run while fitting in a fraction of the memory of 32M-parameter baselines. Ideal for mobile, IoT, browser, and serverless embedding workloads.
|
| 104 |
|
| 105 |
## Why the name Ogma?
|
| 106 |
|
|
|
|
| 136 |
|
| 137 |
## Performance
|
| 138 |
|
| 139 |
+
### MTEB English — 66/66 tasks (category-averaged)
|
| 140 |
|
| 141 |
+
Benchmarked with [MTEB](https://github.com/embeddings-benchmark/mteb) v2.10.7 on the standard 66-task English benchmark using category averaging (same methodology as the MTEB leaderboard).
|
| 142 |
|
| 143 |
| Category | ogma-mini | all-MiniLM-L6-v2 | Δ vs MiniLM |
|
| 144 |
|---|---|---|---|
|
|
|
|
| 333 |
| *potion-base-32M* | 32.0M | 123 MB | *51.22* | 66.0 | 39.2 | 78.2 | 50.9 | 32.2 | 73.9 | 29.8 | 256 | inf |
|
| 334 |
| *potion-base-8M* | 7.6M | 29 MB | *50.03* | 64.44 | 32.93 | 76.62 | 49.73 | 31.71 | 73.24 | 29.28 | 256 | inf |
|
| 335 |
|
| 336 |
+
All Ogma: MTEB 2.10.7, 66-task standard English set, category-averaged.
|
| 337 |
MiniLM/Potion: published scores from the [Model2Vec results page](https://github.com/MinishLab/model2vec/blob/main/results/README.md).
|
| 338 |
|
| 339 |
---
|