Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JamAndTeaStudios
/
DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
like
0
Follow
Jam and Tea Studios
5
Text Generation
Transformers
Safetensors
English
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
compressed-tensors
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
6c8d348
DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
1 contributor
History:
3 commits
yxue-jamandtea
Update README.md
6c8d348
verified
15 days ago
.gitattributes
1.57 kB
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
README.md
3.1 kB
Update README.md
15 days ago
config.json
1.84 kB
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
generation_config.json
181 Bytes
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
model.safetensors
2.25 GB
LFS
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
recipe.yaml
136 Bytes
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
special_tokens_map.json
485 Bytes
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
tokenizer.json
11.4 MB
LFS
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago
tokenizer_config.json
6.75 kB
Initial upload of DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
16 days ago