tomaarsen HF staff commited on
Commit
1fcd62d
1 Parent(s): 5ddaccf

Add exported openvino model 'openvino_model_qint8_quantized.xml'

Browse files

Hello!

*This pull request has been automatically generated from the [`export_static_quantized_openvino_model`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.backend.export_static_quantized_openvino_model) function from the Sentence Transformers library.*

## Config
```python
OVQuantizationConfig(
quant_method=<OVQuantizationMethod.DEFAULT: 'default'>
)
```

## Tip:
Consider testing this pull request before merging by loading the model from this PR with the `revision` argument:
```python
from sentence_transformers import SentenceTransformer

# TODO: Fill in the PR number
pr_number = 2
model = SentenceTransformer(
"mixedbread-ai/mxbai-embed-large-v1",
revision=f"refs/pr/{pr_number}",
backend="openvino",
model_kwargs={"file_name": "openvino_model_qint8_quantized.xml"},
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)
```

openvino/openvino_model_qint8_quantized.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61df334e52dacadebc1cdadd3974be4ec70a339329830a1df67c7ba4f196c7ae
3
+ size 336759312
openvino/openvino_model_qint8_quantized.xml ADDED
The diff for this file is too large to render. See raw diff