Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
seonglae
/
opt-125m-4bit-gptq
like
0
Text Generation
Transformers
opt
auto-gptq
gptq
4bit
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
2019cce
opt-125m-4bit-gptq
1 contributor
History:
5 commits
seonglae
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
2019cce
over 1 year ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
README.md
159 Bytes
docs: notify inference from hugginf face workability
over 1 year ago
config.json
747 Bytes
AutoGPTQ model for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
gptq_model-4bit-128g.bin
pickle
Detected Pickle imports (4)
"torch.IntStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
What is a pickle import?
125 MB
LFS
AutoGPTQ model for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
gptq_model-4bit-128g.safetensors
202 MB
LFS
AutoGPTQ model for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
merges.txt
456 kB
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
quantize_config.json
219 Bytes
AutoGPTQ model for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
special_tokens_map.json
548 Bytes
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
tokenizer.json
2.11 MB
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
tokenizer_config.json
870 Bytes
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago
vocab.json
798 kB
build: AutoGPTQ for facebook/opt-125m: 4bits, gr128, desc_act=False
over 1 year ago