abhinavkulkarni
commited on
Commit
•
1b0c64f
1
Parent(s):
bb97ddf
Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,9 @@ This model was successfully tested on CUDA driver v12.1 and toolkit v11.7 with P
|
|
31 |
git clone https://github.com/mit-han-lab/llm-awq \
|
32 |
&& cd llm-awq \
|
33 |
&& git checkout 71d8e68df78de6c0c817b029a568c064bf22132d \
|
34 |
-
&& pip install -e .
|
|
|
|
|
35 |
```
|
36 |
|
37 |
```python
|
@@ -80,7 +82,7 @@ output = model.generate(
|
|
80 |
repetition_penalty=1.1,
|
81 |
eos_token_id=tokenizer.eos_token_id
|
82 |
)
|
83 |
-
print(tokenizer.decode(output[0]))
|
84 |
```
|
85 |
|
86 |
## Evaluation
|
|
|
31 |
git clone https://github.com/mit-han-lab/llm-awq \
|
32 |
&& cd llm-awq \
|
33 |
&& git checkout 71d8e68df78de6c0c817b029a568c064bf22132d \
|
34 |
+
&& pip install -e . \
|
35 |
+
&& cd awq/kernels \\
|
36 |
+
&& python setup.py install
|
37 |
```
|
38 |
|
39 |
```python
|
|
|
82 |
repetition_penalty=1.1,
|
83 |
eos_token_id=tokenizer.eos_token_id
|
84 |
)
|
85 |
+
print(tokenizer.decode(output[0], skip_special_tokens=True))
|
86 |
```
|
87 |
|
88 |
## Evaluation
|