sleeping4cat commited on
Commit
1d4dda8
·
verified ·
1 Parent(s): ee15135

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -6,4 +6,5 @@ language:
6
  - en
7
  pipeline_tag: text-generation
8
  ---
9
- An 8bit quant version of Qwen2. Enjoy : )
 
 
6
  - en
7
  pipeline_tag: text-generation
8
  ---
9
+ ### Outlook
10
+ We have quantised the model in 8-bit to make it inferenceable in low-end GPU cards at scale. It was achieved thanks to llama.cpp library.