LightEmbed
/

sbert-all-MiniLM-L6-v2-onnx

Sentence Similarity

sentence-transformers

feature-extraction

Model card Files Files and versions Community

binhcode25 commited on Jun 17

Commit

a0f3972

•

1 Parent(s): d792ef7

Add new SentenceTransformer model.

Files changed (3) hide show

README.md +46 -0
model.onnx +2 -2
tokenizer.json +16 -2

README.md ADDED Viewed

	@@ -0,0 +1,46 @@

+---
+library_name: light-embed
+pipeline_tag: sentence-similarity
+tags:
+- sentence-transformers
+- feature-extraction
+- sentence-similarity
+---
+# sbert-all-MiniLM-L6-v2-onnx
+This is the ONNX version of the Sentence Transformers model sentence-transformers/all-MiniLM-L6-v2 for sentence embedding, optimized for speed and lightweight performance. By utilizing onnxruntime and tokenizers instead of heavier libraries like sentence-transformers and transformers, this version ensures a smaller library size and faster execution. Below are the details of the model:
+- Base model: sentence-transformers/all-MiniLM-L6-v2
+- Embedding dimension: 384
+- Max sequence length: 256
+- File size on disk:  0.08 GB
+- Pooling incorporated: Yes
+This ONNX model consists all components in the original sentence transformer model:
+Transformer, Pooling, Normalize
+<!--- Describe your model here -->
+## Usage (LightEmbed)
+Using this model becomes easy when you have [LightEmbed](https://pypi.org/project/light-embed/) installed:
+```
+pip install -U light-embed
+```
+Then you can use the model like this:
+```python
+from light_embed import TextEmbedding
+sentences = ["This is an example sentence", "Each sentence is converted"]
+model = TextEmbedding('sentence-transformers/all-MiniLM-L6-v2')
+embeddings = model.encode(sentences)
+print(embeddings)
+```
+## Citing & Authors
+Binh Nguyen / [email protected]

model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1fef24b391a698bc5a4941d0349925014ee29cc00c21486e09e238b46936b37f
-size 90446038

 version https://git-lfs.github.com/spec/v1
+oid sha256:bf79aa51e1c7a52c48441b1d2234d6b58d1a9e53a75cc8fc91033606cbb6802f
+size 90446096

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 128,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 128
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 0,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,