99860dd
1
2
3
4
5
--- license: mit --- This one with a custom `config.head_dim` as allowed by the architecture (see 7b model).