truro7
/

vn-law-embedding

Sentence Similarity

sentence-transformers

feature-extraction

Generated from Trainer

loss:MatryoshkaLoss

loss:MultipleNegativesRankingLoss

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

truro7 commited on Sep 7, 2024

Commit

ec9a4c9

·

verified ·

1 Parent(s): cec2018

Update README.md

Files changed (1) hide show

README.md +81 -9

README.md CHANGED Viewed

@@ -5,15 +5,91 @@ datasets:
 - truro7/vn-law-questions-and-corpus
 language:
 - vi
-metrics:
-- accuracy
-- precision
-- recall
 base_model: hiieu/halong_embedding
-pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 tags:
 - legal
 ---
@@ -25,7 +101,3 @@ The model is trained on a dataset of Vietnamese legal questions and correspondin
 It uses Matryoshka loss during training and can be truncated to smaller dimensions, allowing for faster comparisons between queries and documents without sacrificing performance.
----
-license: apache-2.0
----

 - truro7/vn-law-questions-and-corpus
 language:
 - vi
 base_model: hiieu/halong_embedding
 library_name: sentence-transformers
+metrics:
+- cosine_accuracy@1
+- cosine_accuracy@3
+- cosine_accuracy@5
+- cosine_accuracy@10
+- cosine_precision@1
+- cosine_precision@3
+- cosine_precision@5
+- cosine_precision@10
+- cosine_recall@1
+- cosine_recall@3
+- cosine_recall@5
+- cosine_recall@10
+- cosine_ndcg@10
+- cosine_mrr@10
+- cosine_map@100
+pipeline_tag: sentence-similarity
 tags:
 - legal
+- sentence-transformers
+- sentence-similarity
+- feature-extraction
+- generated_from_trainer
+- loss:MatryoshkaLoss
+- loss:MultipleNegativesRankingLoss
+model-index:
+- name: Halong Embedding
+  results:
+  - task:
+      type: information-retrieval
+      name: Information Retrieval
+    metrics:
+    - type: cosine_accuracy@1
+      value: 0.623
+      name: Cosine Accuracy@1
+    - type: cosine_accuracy@3
+      value: 0.792
+      name: Cosine Accuracy@3
+    - type: cosine_accuracy@5
+      value: 0.851
+      name: Cosine Accuracy@5
+    - type: cosine_accuracy@10
+      value: 0.900
+      name: Cosine Accuracy@10
+    - type: cosine_precision@1
+      value: 0.623
+      name: Cosine Precision@1
+    - type: cosine_precision@3
+      value: 0.412
+      name: Cosine Precision@3
+    - type: cosine_precision@5
+      value: 0.310
+      name: Cosine Precision@5
+    - type: cosine_precision@10
+      value: 0.184
+      name: Cosine Precision@10
+    - type: cosine_recall@1
+      value: 0.353
+      name: Cosine Recall@1
+    - type: cosine_recall@3
+      value: 0.608
+      name: Cosine Recall@3
+    - type: cosine_recall@5
+      value: 0.722
+      name: Cosine Recall@5
+    - type: cosine_recall@10
+      value: 0.823
+      name: Cosine Recall@10
+    - type: cosine_ndcg@10
+      value: 0.706
+      name: Cosine Ndcg@10
+    - type: cosine_mrr@10
+      value: 0.717
+      name: Cosine Mrr@10
+    - type: cosine_map@100
+      value: 0.645
+      name: Cosine Map@100
 ---
 It uses Matryoshka loss during training and can be truncated to smaller dimensions, allowing for faster comparisons between queries and documents without sacrificing performance.