ggml files of bge-base-en
You can use this ggml for https://github.com/skeskinen/bert.cpp
bge-base-en
| Data Type | STSBenchmark | eval time | EmotionClassification | eval time |
|---|---|---|---|---|
| f32 | 0.8630 | 39.56 | 0.5533 | 69.55 |
| f16 | 0.8630 | 32.95 | 0.5533 | 55.75 |
| q4_0 | 0.8627 | 27.23 | 0.5540 | 73.29 |
| q4_1 | 0.8654 | 29.78 | 0.5508 | 69.81 |
all-MiniLM-L12-v2
| Data Type | STSBenchmark | eval time | EmotionClassification | eval time |
|---|---|---|---|---|
| f32 | 0.8306 | 13.36 | 0.4117 | 21.23 |
| f16 | 0.8306 | 11.51 | 0.4119 | 20.08 |
| q4_0 | 0.8310 | 11.27 | 0.4183 | 20.81 |
| q4_1 | 0.8325 | 12.37 | 0.4093 | 19.38 |
all-MiniLM-L6-v2
| Data Type | STSBenchmark | eval time | EmotionClassification | eval time |
|---|---|---|---|---|
| f32 | 0.8201 | 6.83 | 0.4082 | 11.34 |
| f16 | 0.8201 | 6.17 | 0.4085 | 10.28 |
| q4_0 | 0.8175 | 5.45 | 0.3911 | 10.63 |
| q4_1 | 0.8223 | 6.79 | 0.4027 | 11.41 |
bert-base-uncased
| Data Type | STSBenchmark | eval time | EmotionClassification | eval time |
|---|---|---|---|---|
| f32 | 0.4738 | 52.38 | 0.3361 | 88.56 |
| f16 | 0.4739 | 33.24 | 0.3361 | 55.86 |
| q4_0 | 0.4940 | 33.93 | 0.3375 | 57.82 |
| q4_1 | 0.4612 | 36.86 | 0.3318 | 59.63 |
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support