Update README.md
Browse files
README.md
CHANGED
|
@@ -7,13 +7,14 @@ license: cc-by-sa-4.0
|
|
| 7 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 8 |
|
| 9 |
|
| 10 |
-
**slim-summary-tool** is a 4_K_M quantized GGUF version of slim-summary, providing a small, fast inference implementation, optimized for multi-model concurrent deployment.
|
| 11 |
|
| 12 |
-
The size of the self-contained GGUF model binary is 1.71 GB, which is small enough to run locally on a CPU with reasonable inference speed.
|
| 13 |
|
| 14 |
The model takes as input a text passage, an optional parameter with a focusing phrase or query, and an experimental optional (N) parameter, which is used to guide the model to a specific number of items return in a summary list.
|
| 15 |
|
| 16 |
-
[**slim-summary**](https://huggingface.co/llmware/slim-summary)
|
|
|
|
| 17 |
|
| 18 |
To pull the model via API:
|
| 19 |
|
|
|
|
| 7 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 8 |
|
| 9 |
|
| 10 |
+
**slim-summary-tool** is a 4_K_M quantized GGUF version of slim-summary, providing a small, fast inference implementation, optimized for multi-model concurrent deployment, to provide high-quality summarizations of complex business documents, on a small, specialized locally-deployable model.
|
| 11 |
|
| 12 |
+
The size of the self-contained GGUF model binary is 1.71 GB, which is small enough to run locally on a CPU with reasonable inference speed, and has been optimized to maximize high-quality with the ability to deploy on a local machine.
|
| 13 |
|
| 14 |
The model takes as input a text passage, an optional parameter with a focusing phrase or query, and an experimental optional (N) parameter, which is used to guide the model to a specific number of items return in a summary list.
|
| 15 |
|
| 16 |
+
Please see the usage notes at: [**slim-summary**](https://huggingface.co/llmware/slim-summary)
|
| 17 |
+
|
| 18 |
|
| 19 |
To pull the model via API:
|
| 20 |
|