Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

study-hjt
/

Qwen1.5-32B-Chat-GPTQ-Int8

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Qwen1.5-32B-Chat-GPTQ-Int8

2 contributors

History: 2 commits

Jintao Huang

first commit

7710fe0 10 months ago

.gitattributes

1.52 kB

initial commit 10 months ago
LICENSE

6.9 kB

first commit 10 months ago
README.md

4.9 kB

first commit 10 months ago
added_tokens.json

80 Bytes

first commit 10 months ago
config.json

988 Bytes

first commit 10 months ago
configuration.json

87 Bytes

first commit 10 months ago
generation_config.json

243 Bytes

first commit 10 months ago
merges.txt

1.67 MB

first commit 10 months ago
model-00001-of-00004.safetensors

9.98 GB
LFS

first commit 10 months ago
model-00002-of-00004.safetensors

9.97 GB
LFS

first commit 10 months ago
model-00003-of-00004.safetensors

9.91 GB
LFS

first commit 10 months ago
model-00004-of-00004.safetensors

4.96 GB
LFS

first commit 10 months ago
model.safetensors.index.json

192 kB

first commit 10 months ago
quantize_config.json

243 Bytes

first commit 10 months ago
special_tokens_map.json

253 Bytes

first commit 10 months ago
tokenizer.json

7.03 MB

first commit 10 months ago
tokenizer_config.json

1.3 kB

first commit 10 months ago
vocab.json

2.78 MB

first commit 10 months ago