Nethermind
/

Mpt-Instruct-DotNet-XS

Text Generation

Model card Files Files and versions Community

Mpt-Instruct-DotNet-XS / ggml-model-f16.bin

Commit History

GGML models that can run f16 41.68 ms per token and q8 23.76 ms per token giving good results

56d7c99

Kabumbus commited on Sep 11, 2023