Update README.md
Browse files
README.md
CHANGED
|
@@ -1,12 +1,18 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
-
sdk:
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
|
|
|
|
|
|
| 10 |
# VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
|
| 11 |
|
| 12 |
## TL;DR
|
|
|
|
| 1 |
---
|
| 2 |
+
title: VPTQ demo
|
| 3 |
+
emoji: 🚀
|
| 4 |
+
colorFrom: blue
|
| 5 |
+
colorTo: green
|
| 6 |
+
sdk: gradio
|
| 7 |
+
sdk_version: 4.36.1
|
| 8 |
+
app_file: app.py
|
| 9 |
+
pinned: true
|
| 10 |
+
license: mit
|
| 11 |
+
short_description: Vector Post-Training Quantization (VPTQ) Demo
|
| 12 |
---
|
| 13 |
|
| 14 |
+
An example chatbot using [VPTQ](https://github.com/microsoft/VPTQ), [huggingface community](https://huggingface.co/spaces/VPTQ-community/).
|
| 15 |
+
|
| 16 |
# VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
|
| 17 |
|
| 18 |
## TL;DR
|