KV_shifting_2.9B / README.md
xumingyu16's picture
Update README.md
8b8b292 verified
metadata
license: apache-2.0

I have discovered an open-source implementation for KV Shifting Attention. https://github.com/erogol/BlaGPT

If you want to get started quickly, you can use 8 A100 and verify it in 2 hours.