File size: 218 Bytes
8b8b292 |
1 2 3 4 5 6 |
---
license: apache-2.0
---
I have discovered an open-source implementation for KV Shifting Attention. https://github.com/erogol/BlaGPT
If you want to get started quickly, you can use 8 A100 and verify it in 2 hours. |