my-awesome-model / README.md
simarora's picture
Create README.md
5fb3686 verified
---
datasets:
- EleutherAI/pile
language:
- en
---
Based model but uses layernorm instead of QK.sum(-1) for the normalization, for better hardware efficiency.