Update README.md
Browse files
README.md
CHANGED
|
@@ -8,3 +8,32 @@ base_model:
|
|
| 8 |
|
| 9 |
> zerank-1 is a state-of-the-art reranker developed by ZeroEntropy to significantly enhance retrieval accuracy in search engines. Unlike most SOTA rerankers that are closed-source and proprietary, zerank-1 outperforms top models like Cohere-Rerank-v3.5 and Salesforce/LlamaRank-v1 across diverse domains including finance, legal, code, STEM, medical, and conversational data.
|
| 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
|
| 9 |
> zerank-1 is a state-of-the-art reranker developed by ZeroEntropy to significantly enhance retrieval accuracy in search engines. Unlike most SOTA rerankers that are closed-source and proprietary, zerank-1 outperforms top models like Cohere-Rerank-v3.5 and Salesforce/LlamaRank-v1 across diverse domains including finance, legal, code, STEM, medical, and conversational data.
|
| 10 |
|
| 11 |
+
## Model files
|
| 12 |
+
|
| 13 |
+
| File | Size | Format |
|
| 14 |
+
|------|------|--------|
|
| 15 |
+
| zerank-1.BF16.gguf | 8.05 GB | BF16 |
|
| 16 |
+
| zerank-1.F16.gguf | 8.05 GB | F16 |
|
| 17 |
+
| zerank-1.F32.gguf | 16.1 GB | F32 |
|
| 18 |
+
| zerank-1.Q2_K.gguf | 1.67 GB | Q2_K |
|
| 19 |
+
| zerank-1.Q3_K_L.gguf | 2.24 GB | Q3_K_L |
|
| 20 |
+
| zerank-1.Q3_K_M.gguf | 2.08 GB | Q3_K_M |
|
| 21 |
+
| zerank-1.Q3_K_S.gguf | 1.89 GB | Q3_K_S |
|
| 22 |
+
| zerank-1.Q4_K_M.gguf | 2.5 GB | Q4_K_M |
|
| 23 |
+
| zerank-1.Q4_K_S.gguf | 2.38 GB | Q4_K_S |
|
| 24 |
+
| zerank-1.Q5_K_M.gguf | 2.89 GB | Q5_K_M |
|
| 25 |
+
| zerank-1.Q5_K_S.gguf | 2.82 GB | Q5_K_S |
|
| 26 |
+
| zerank-1.Q6_K.gguf | 3.31 GB | Q6_K |
|
| 27 |
+
| zerank-1.Q8_0.gguf | 4.28 GB | Q8_0 |
|
| 28 |
+
| .gitattributes | 2.39 kB | - |
|
| 29 |
+
| README.md | 462 Bytes | - |
|
| 30 |
+
| config.json | 29 Bytes | - |
|
| 31 |
+
|
| 32 |
+
## Quants Usage
|
| 33 |
+
|
| 34 |
+
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
| 35 |
+
|
| 36 |
+
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
| 37 |
+
types (lower is better):
|
| 38 |
+
|
| 39 |
+

|