Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
study-hjt
/
Qwen1.5-32B-Chat-GPTQ-Int8
like
1
Text Generation
Transformers
Safetensors
English
Chinese
qwen2
qwen
32b
gptq
int8
chat
conversational
text-generation-inference
Inference Endpoints
8-bit precision
arxiv:
2309.16609
License:
tongyi-qianwen
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
7710fe0
Qwen1.5-32B-Chat-GPTQ-Int8
2 contributors
History:
2 commits
Jintao Huang
first commit
7710fe0
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
LICENSE
6.9 kB
first commit
10 months ago
README.md
4.9 kB
first commit
10 months ago
added_tokens.json
80 Bytes
first commit
10 months ago
config.json
988 Bytes
first commit
10 months ago
configuration.json
87 Bytes
first commit
10 months ago
generation_config.json
243 Bytes
first commit
10 months ago
merges.txt
1.67 MB
first commit
10 months ago
model-00001-of-00004.safetensors
9.98 GB
LFS
first commit
10 months ago
model-00002-of-00004.safetensors
9.97 GB
LFS
first commit
10 months ago
model-00003-of-00004.safetensors
9.91 GB
LFS
first commit
10 months ago
model-00004-of-00004.safetensors
4.96 GB
LFS
first commit
10 months ago
model.safetensors.index.json
192 kB
first commit
10 months ago
quantize_config.json
243 Bytes
first commit
10 months ago
special_tokens_map.json
253 Bytes
first commit
10 months ago
tokenizer.json
7.03 MB
first commit
10 months ago
tokenizer_config.json
1.3 kB
first commit
10 months ago
vocab.json
2.78 MB
first commit
10 months ago