Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pankajroark
/
llama-fp16-engine
like
0
Model card
Files
Files and versions
xet
Community
2319002
llama-fp16-engine
Ctrl+K
Ctrl+K
1 contributor
History:
8 commits
pankajroark
inflight batching engine for 7b-sq-int8kv-tp1
2319002
almost 2 years ago
7b-no-quant-tp1
update non-quant engine again, this time with one that is locally tested
almost 2 years ago
7b-sq-int8kv-tp1
inflight batching engine for 7b-sq-int8kv-tp1
almost 2 years ago
7b-sq-int8kv-tp8
tp8 checkpoint
almost 2 years ago
.gitattributes
Safe
1.56 kB
checkpoint
almost 2 years ago
.gitignore
Safe
5 Bytes
checkpoint
almost 2 years ago