Update app.py
Browse filesSigned-off-by: SimJeg <[email protected]>
app.py
CHANGED
@@ -63,6 +63,7 @@ def gradio_interface():
|
|
63 |
# Wikipedia Article Question Answering with kvpress
|
64 |
This demo uses the llama 3.1 8B Instruct model to answer questions about any given Wikipedia article.
|
65 |
Under the hood, [kvpress](https://github.com/NVIDIA/kvpress) *compresses the key-value (KV) cache* associated with the article, helping reduce memory usage and accelerate decoding.
|
|
|
66 |
**How to use:**
|
67 |
1. Enter a Wikipedia article URL
|
68 |
2. Type your question
|
|
|
63 |
# Wikipedia Article Question Answering with kvpress
|
64 |
This demo uses the llama 3.1 8B Instruct model to answer questions about any given Wikipedia article.
|
65 |
Under the hood, [kvpress](https://github.com/NVIDIA/kvpress) *compresses the key-value (KV) cache* associated with the article, helping reduce memory usage and accelerate decoding.
|
66 |
+
|
67 |
**How to use:**
|
68 |
1. Enter a Wikipedia article URL
|
69 |
2. Type your question
|