CodeLlama-7b-Python gguf
Original Meta CodeLlama-7b-Python
model converted with python3 convert.py to gguf
and
CodeLlama-7b-Python/ggml-model-f32.gguf
and splitted with gguf-split to smaller size chunks up to split-max-tensors 32
.
python3 convert.py ../codellama/CodeLlama-7b-Python
./gguf-split --split --split-max-tensors 32 ./models/CodeLlama-7b-Python/ggml-model-f32.gguf ./models/CodeLlama-7b-Python/ggml-model-f32