| --- |
| license: apache-2.0 |
| --- |
|  |
| # embeddings |
| This is a repo created to keep a collection of quantized bert models in ggml format. |
|
|
| ### usage |
| You can utilize [bert.cpp](https://github.com/skeskinen/bert.cpp) as usual or use [our new api](https://rapidapi.com/nohakcoffee/api/simple-similarity) to quickly prototype on real use-case scenarios for text similarity. |
|
|
|
|
| | api | mb | |
| | ---- | ---- | |
| | nano | 11.2 | |
| | small | 14.5 | |
| | medium | 21.3 | |
| | large | 68.8 | |
|
|
|
|
| We are planning to update the list to always support lastest open-source models. |
|
|
| ### embeddings sample |
|
|
| ``` |
| ./main -m small -p "word" |
| [0.0698, -0.0024, -0.0153, 0.0193, -0.1060, -0.0278, 0.1424, -0.0056, -0.0536... |
| ``` |
|
|
| ### api sample |
|
|
| ``` |
| |
| { ..., |
| "similarity": { |
| "semantic": 0.035270897102758283, |
| "literal": 0.22219999999999998 |
| } |
| } |
| ``` |
|
|