Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds
like
0
Follow
Neural Magic
287
Text Generation
Transformers
ONNX
llama
deepsparse
arxiv:
2301.00774
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds
/
onnx_kv_inject.py
Commit History
Create onnx_kv_inject.py
d9b2258
mwitiderrick
commited on
Nov 29, 2023