mwitiderrick
commited on
Commit
·
bced865
1
Parent(s):
e788988
Update README.md
Browse files
README.md
CHANGED
@@ -108,4 +108,8 @@ python sparseml/src/sparseml/transformers/sparsification/obcq/export.py --task t
|
|
108 |
cp deployment/model.onnx deployment/model-orig.onnx
|
109 |
wget https://huggingface.co/nm-testing/TinyLlama-1.1B-Chat-v0.4-pruned50-quant/raw/main/onnx_kv_inject.py # kv_cache file
|
110 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
111 |
-
```
|
|
|
|
|
|
|
|
|
|
108 |
cp deployment/model.onnx deployment/model-orig.onnx
|
109 |
wget https://huggingface.co/nm-testing/TinyLlama-1.1B-Chat-v0.4-pruned50-quant/raw/main/onnx_kv_inject.py # kv_cache file
|
110 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
111 |
+
```
|
112 |
+
|
113 |
+
## Slack
|
114 |
+
|
115 |
+
For further support, and discussions on these models and AI in general, join us at [Neural Magic's Slack server](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ)
|