Update README.md
Browse files
README.md
CHANGED
|
@@ -6,4 +6,5 @@ language:
|
|
| 6 |
- en
|
| 7 |
pipeline_tag: text-generation
|
| 8 |
---
|
| 9 |
-
|
|
|
|
|
|
| 6 |
- en
|
| 7 |
pipeline_tag: text-generation
|
| 8 |
---
|
| 9 |
+
### Outlook
|
| 10 |
+
We have quantised the model in 8-bit to make it inferenceable in low-end GPU cards at scale. It was achieved thanks to llama.cpp library.
|