Update README.md
Browse files
README.md
CHANGED
|
@@ -4,9 +4,9 @@ inference: false
|
|
| 4 |
tags: [green, llmware-rag, p1, ov]
|
| 5 |
---
|
| 6 |
|
| 7 |
-
# bling-tiny-llama-
|
| 8 |
|
| 9 |
-
**bling-tiny-llama-
|
| 10 |
|
| 11 |
This model is one of the smallest and fastest in the series. For higher accuracy, look at larger models in the BLING/DRAGON series.
|
| 12 |
|
|
@@ -23,7 +23,7 @@ This model is one of the smallest and fastest in the series. For higher accurac
|
|
| 23 |
- **RAG Benchmark Accuracy Score:** 86.5
|
| 24 |
|
| 25 |
|
| 26 |
-
Get started right away with [
|
| 27 |
|
| 28 |
Looking for AI PC solutions, contact us at [llmware](https://www.llmware.ai)
|
| 29 |
|
|
|
|
| 4 |
tags: [green, llmware-rag, p1, ov]
|
| 5 |
---
|
| 6 |
|
| 7 |
+
# bling-tiny-llama-onnx
|
| 8 |
|
| 9 |
+
**bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, and quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.
|
| 10 |
|
| 11 |
This model is one of the smallest and fastest in the series. For higher accuracy, look at larger models in the BLING/DRAGON series.
|
| 12 |
|
|
|
|
| 23 |
- **RAG Benchmark Accuracy Score:** 86.5
|
| 24 |
|
| 25 |
|
| 26 |
+
Get started right away with [ONNX Runtime](https://github.com/microsoft/onnxruntime)
|
| 27 |
|
| 28 |
Looking for AI PC solutions, contact us at [llmware](https://www.llmware.ai)
|
| 29 |
|