How to use the ONNX format of BERT-based model bge-m3, how to load the model and output sentence embeddings.

#52

by panjiayi - opened May 15, 2024

May 15, 2024

How to use the ONNX format of BERT-based model bge-m3, how to load the model and output sentence embeddings. Please tell me. Thank you!

Shitao

Beijing Academy of Artificial Intelligence org May 15, 2024

You can refer to this discussion: https://huggingface.co/BAAI/bge-m3/discussions/50

chaochaoli

Jul 10, 2024

onnx use cls pooling？
is this right？
···
model_ort = ORTModelForFeatureExtraction.from_pretrained(os.path.join(model_path, "onnx"), export=False)

def encode(text):
encoded_input = tokenizer(text, padding=True, truncation=True, return_tensors='pt')
model_output_ort = model_ort(**encoded_input)
return model_output_ort['last_hidden_state'][0][0, :]
···

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment