Spaces:
Running
on
Zero
Running
on
Zero
Update app.py
Browse filesSigned-off-by: SimJeg <sjegou@nvidia.com>
app.py
CHANGED
|
@@ -63,6 +63,7 @@ def gradio_interface():
|
|
| 63 |
# Wikipedia Article Question Answering with kvpress
|
| 64 |
This demo uses the llama 3.1 8B Instruct model to answer questions about any given Wikipedia article.
|
| 65 |
Under the hood, [kvpress](https://github.com/NVIDIA/kvpress) *compresses the key-value (KV) cache* associated with the article, helping reduce memory usage and accelerate decoding.
|
|
|
|
| 66 |
**How to use:**
|
| 67 |
1. Enter a Wikipedia article URL
|
| 68 |
2. Type your question
|
|
|
|
| 63 |
# Wikipedia Article Question Answering with kvpress
|
| 64 |
This demo uses the llama 3.1 8B Instruct model to answer questions about any given Wikipedia article.
|
| 65 |
Under the hood, [kvpress](https://github.com/NVIDIA/kvpress) *compresses the key-value (KV) cache* associated with the article, helping reduce memory usage and accelerate decoding.
|
| 66 |
+
|
| 67 |
**How to use:**
|
| 68 |
1. Enter a Wikipedia article URL
|
| 69 |
2. Type your question
|