Ctrl+K

3 contributors

History: 6 commits

可亲

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm

288c348 11 months ago

.gitattributes

1.52 kB

initial commit 11 months ago
LICENSE

6.96 kB

Create LICENSE 11 months ago
README.md

18.9 kB

Update README.md 11 months ago
added_tokens.json

392 Bytes

Upload folder using huggingface_hub 11 months ago
chat_template.json

1.05 kB

Upload folder using huggingface_hub 11 months ago
config.json

1.39 kB

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
generation_config.json

247 Bytes

Upload folder using huggingface_hub 11 months ago
merges.txt

1.67 MB

Upload folder using huggingface_hub 11 months ago
model-00001-of-00011.safetensors

3.97 GB
xet

Upload folder using huggingface_hub 11 months ago
model-00002-of-00011.safetensors

3.92 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00003-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00004-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00005-of-00011.safetensors

3.92 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00006-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00007-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00008-of-00011.safetensors

3.92 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00009-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00010-of-00011.safetensors

4 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model-00011-of-00011.safetensors

3.33 GB
xet

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
model.safetensors.index.json

244 kB

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm 11 months ago
preprocessor_config.json

594 Bytes

Upload folder using huggingface_hub 11 months ago
quantize_config.json

207 Bytes

Upload folder using huggingface_hub 11 months ago
special_tokens_map.json

613 Bytes

Upload folder using huggingface_hub 11 months ago
tokenizer.json

7.03 MB

Upload folder using huggingface_hub 11 months ago
tokenizer_config.json

4.3 kB

Upload folder using huggingface_hub 11 months ago
vocab.json

2.78 MB

Upload folder using huggingface_hub 11 months ago