metadata
license: apache-2.0
embeddings
This is a repo created to keep a collection of quantized bert models in ggml format.
usage
You can utilize bert.cpp as usual or use our new api to quickly prototype on real use-case scenarios for text similarity.
| api | mb |
|---|---|
| nano | 11.2 |
| small | 14.5 |
| medium | 21.3 |
| large | 68.8 |
We are planning to update the list to always support lastest open-source models.
embeddings sample
./main -m small -p "word"
[0.0698, -0.0024, -0.0153, 0.0193, -0.1060, -0.0278, 0.1424, -0.0056, -0.0536...
api sample
{ ...,
"similarity": {
"semantic": 0.035270897102758283,
"literal": 0.22219999999999998
}
}
