Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-1_8B-Chat-Int8
like
4
Follow
Qwen
1,858
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
2309.16609
arxiv:
2305.08322
arxiv:
2009.03300
Model card
Files
Files and versions
Community
2
Train
Use this model
d53dc51
Qwen-1_8B-Chat-Int8
1 contributor
History:
7 commits
xingzhang
update modeling_qwen.py
d53dc51
11 months ago
assets
update
12 months ago
examples
update
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
LICENSE
Safe
7.28 kB
update
12 months ago
NOTICE
Safe
15.3 kB
update
12 months ago
README.md
Safe
24.2 kB
update readme
12 months ago
cache_autogptq_cuda_256.cpp
Safe
8.4 kB
init model
12 months ago
cache_autogptq_cuda_kernel_256.cu
Safe
52 kB
init model
12 months ago
config.json
Safe
1.2 kB
update
12 months ago
configuration_qwen.py
Safe
2.35 kB
init model
12 months ago
cpp_kernels.py
Safe
1.92 kB
init model
12 months ago
generation_config.json
Safe
249 Bytes
init model
12 months ago
model.safetensors
Safe
2.49 GB
LFS
init model
12 months ago
modeling_qwen.py
Safe
55.6 kB
update modeling_qwen.py
11 months ago
quantize_config.json
Safe
214 Bytes
update
12 months ago
qwen.tiktoken
Safe
2.56 MB
init model
12 months ago
qwen_generation_utils.py
Safe
14.6 kB
init model
12 months ago
tokenization_qwen.py
Safe
9.62 kB
init model
12 months ago
tokenizer_config.json
Safe
173 Bytes
init model
12 months ago