hooman650
/

bge-m3-onnx-o4

Feature Extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

hooman650 commited on Feb 6, 2024

Commit

68e3232

·

verified ·

1 Parent(s): d8e90a9

Update README.md

add important download info

Files changed (1) hide show

README.md +19 -2

README.md CHANGED Viewed

@@ -11,7 +11,23 @@ This is `bge-m3-onnx-o4` weights of the original [`BAAI/bge-m3`](https://hugging
 - [x] Multi-Linguality: It can support more than **100** working languages.
 - [x] Multi-Granularity: It is able to process inputs of different granularities, spanning from short sentences to long documents of up to **8192** tokens.
-## Usage
 ### Dense Retrieval
@@ -26,7 +42,8 @@ from optimum.onnxruntime import ORTModelForFeatureExtraction
 from transformers import AutoTokenizer
 import torch
-model = ORTModelForFeatureExtraction.from_pretrained("hooman650/bge-m3-onnx-o4", provider="CUDAExecutionProvider")
 tokenizer = AutoTokenizer.from_pretrained("hooman650/bge-m3-onnx-o4")
 sentences = [

 - [x] Multi-Linguality: It can support more than **100** working languages.
 - [x] Multi-Granularity: It is able to process inputs of different granularities, spanning from short sentences to long documents of up to **8192** tokens.
+## Usage
+### IMPORTANT - DOWNLOAD MODEL WEIGHTS
+Please see the instructions below.
+1. **Download** the checkpoint: For some reason you cannot directly load from this online version (you will get an exception).
+Please download this repo as below:
+```
+# pip install huggingface-hub
+from huggingface_hub import snapshot_download
+snapshot_download(repo_id="hooman650/bge-m3-onnx-o4",local_dir="bge-m3-onnx")
+```
 ### Dense Retrieval
 from transformers import AutoTokenizer
 import torch
+# Make sure that you download the model weights locally to `bge-m3-onnx`
+model = ORTModelForFeatureExtraction.from_pretrained("bge-m3-onnx", provider="CUDAExecutionProvider") # omit provider for CPU usage.
 tokenizer = AutoTokenizer.from_pretrained("hooman650/bge-m3-onnx-o4")
 sentences = [