I have discovered an open-source implementation for KV Shifting Attention. https://github.com/erogol/BlaGPT
If you want to get started quickly, you can use 8 A100 and verify it in 2 hours.
- Downloads last month
- 12
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.