Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
compressa-ai
/
Saiga-Llama-3-8B-OmniQuant
like
0
Follow
Compressa
8
Text Generation
Transformers
Safetensors
Russian
llama
saiga
llama3
omniquant
gptq
triton
conversational
text-generation-inference
Inference Endpoints
4-bit precision
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
89f2cc0
Saiga-Llama-3-8B-OmniQuant
2 contributors
History:
7 commits
Vasily Alexeev
fix titles style for summary
89f2cc0
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
6.8 kB
fix titles style for summary
10 months ago
compressa-config.json
663 Bytes
add weights and stuff
10 months ago
config.json
885 Bytes
add weights and stuff
10 months ago
generation_config.json
277 Bytes
add weights and stuff
10 months ago
model-00001-of-00002.safetensors
4.68 GB
LFS
add weights and stuff
10 months ago
model-00002-of-00002.safetensors
1.05 GB
LFS
add weights and stuff
10 months ago
model.safetensors.index.json
78.5 kB
add weights and stuff
10 months ago
quant_config.json
63 Bytes
add weights and stuff
10 months ago
special_tokens_map.json
563 Bytes
add weights and stuff
10 months ago
tokenizer.json
9.08 MB
add weights and stuff
10 months ago
tokenizer_config.json
51.3 kB
add weights and stuff
10 months ago