Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
hongyin
/
chat-llama2-70b-4bit
like
0
Text Generation
Transformers
PyTorch
English
Chinese
llama
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2302.13173
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
24d0fcd
chat-llama2-70b-4bit
1 contributor
History:
10 commits
hongyin
Upload pytorch_model.bin.index.json
24d0fcd
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
README.md
969 Bytes
Update README.md
12 months ago
added_tokens.json
42 Bytes
Upload 2 files
12 months ago
config.json
1.21 kB
Upload 2 files
12 months ago
pytorch_model.bin.index.json
241 kB
Upload pytorch_model.bin.index.json
12 months ago