mpt-7b-storywriter-fast / flash_attn_triton.py

Commit History

add fast loading/inference
2e25a9a

emozilla commited on

add flash_attn_triton.py (#20)
84cfa23

daking vchiley commited on