Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
/
OLMoE-1B-7B-0924
like
91
Text Generation
Transformers
Safetensors
allenai/OLMoE-mix-0924
English
olmoe
Mixture of Experts
olmo
Carbon Emissions
Inference Endpoints
arxiv:
2409.02060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
83792c6
OLMoE-1B-7B-0924
4 contributors
History:
20 commits
awettig
Fix load balancing coefficient
83792c6
verified
21 days ago
.gitattributes
2.14 kB
fp32 -> bf16; step1200000
about 2 months ago
README.md
2.46 kB
Update README.md
22 days ago
config.json
759 Bytes
Fix load balancing coefficient
21 days ago
generation_config.json
120 Bytes
Update generation_config.json
about 2 months ago
model-00001-of-00003.safetensors
5 GB
LFS
Move anneal branch to main
about 1 month ago
model-00002-of-00003.safetensors
5 GB
LFS
Move anneal branch to main
about 1 month ago
model-00003-of-00003.safetensors
3.84 GB
LFS
Move anneal branch to main
about 1 month ago
model.safetensors.index.json
287 kB
fp32 -> bf16; step1200000
about 2 months ago
olmoe-logo.png
23.8 kB
Upload olmoe-logo.png
22 days ago
special_tokens_map.json
65 Bytes
Add step1200000
about 2 months ago
tokenizer.json
2.12 MB
Add step1200000
about 2 months ago
tokenizer_config.json
5.37 kB
Add step1200000
about 2 months ago