README.md · neuralmagic/bge-base-en-v1.5-quant at 487aa68454cd3100ef4501b04ec06fce672ff11d

metadata

license: mit

This is the quantized (INT8) ONNX variant of the bge-base-en-v1.5 embeddings model created with DeepSparse Optimum for ONNX export/inference pipeline and Neural Magic's Sparsify for One-Shot quantization.

Current up-to-date list of sparse and quantized bge ONNX models:

zeroshot/bge-large-en-v1.5-sparse

zeroshot/bge-large-en-v1.5-quant

zeroshot/bge-base-en-v1.5-sparse

zeroshot/bge-base-en-v1.5-quant

zeroshot/bge-small-en-v1.5-sparse

zeroshot/bge-small-en-v1.5-quant