npip99 commited on
Commit
d414210
·
verified ·
1 Parent(s): 1972d8f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -21
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- license: cc-by-nc-4.0
3
  language:
4
  - en
5
  base_model:
@@ -12,16 +12,16 @@ tags:
12
  - stem
13
  - medical
14
  ---
15
- # zerank-1-small: Smaller, faster version of zerank-1
16
 
17
- <!-- Provide a quick summary of what the model is/does. -->
18
 
 
19
 
20
- This model is the smaller version of [zeroentropy/zerank-1](https://huggingface.co/zeroentropy/zerank-1). Though the model is over 2x smaller, it maintains nearly the same standard of performance, continuing to outperform other popular rerankers.
21
-
22
- It is an open-weights reranker model meant to be integrated into RAG applications to rerank results from preliminary search methods such as embeddings, BM25, and hybrid search.
23
 
 
24
 
 
25
 
26
  ## How to Use
27
 
@@ -40,9 +40,11 @@ scores = model.predict(query_documents)
40
  print(scores)
41
  ```
42
 
 
 
43
  ## Evaluations
44
 
45
- Comparing NDCG@10 starting from top 100 documents by embedding (using text-3-embedding-small):
46
 
47
  | Task | Embedding | cohere-rerank-v3.5 | Salesforce/Llama-rank-v1 | **zerank-1-small** | zerank-1 |
48
  |----------------|-----------|--------------------|--------------------------|----------------|----------|
@@ -53,18 +55,6 @@ Comparing NDCG@10 starting from top 100 documents by embedding (using text-3-emb
53
  | Medical | 0.619 | 0.750 | 0.719 | **0.773** | 0.796 |
54
  | STEM | 0.401 | 0.510 | 0.595 | **0.680** | 0.694 |
55
 
56
- Comparing BM25 and Hybrid Search without and with zerank-1:
57
-
58
- <img src="https://cdn-uploads.huggingface.co/production/uploads/67776f9dcd9c9435499eafc8/2GPVHFrI39FspnSNklhsM.png" alt="Description" width="400"/> <img src="https://cdn-uploads.huggingface.co/production/uploads/67776f9dcd9c9435499eafc8/dwYo2D7hoL8QiE8u3yqr9.png" alt="Description" width="400"/>
59
-
60
-
61
- ## Citation
62
-
63
- **BibTeX:**
64
-
65
- Coming soon!
66
-
67
- **APA:**
68
-
69
- Coming soon!
70
 
 
 
1
  ---
2
+ license: apache-2.0
3
  language:
4
  - en
5
  base_model:
 
12
  - stem
13
  - medical
14
  ---
 
15
 
16
+ <img src="https://i.imgur.com/oxvhvQu.png"/>
17
 
18
+ # Releasing zeroentropy/zerank-1-small
19
 
20
+ In search enginers, [rerankers are crucial](https://www.zeroentropy.dev/blog/what-is-a-reranker-and-do-i-need-one) for improving the accuracy of your retrieval system.
 
 
21
 
22
+ This 1.7B reranker is the smaller version of our flagship model [zeroentropy/zerank-1](https://huggingface.co/zeroentropy/zerank-1). Though the model is over 2x smaller, it maintains nearly the same standard of performance, continuing to outperform other popular rerankers, and displaying massive accuracy gains over traditional vector search.
23
 
24
+ We release this model under the open-source Apache 2.0 license, in order to support the open-source community and push the frontier of what's possible with open-source models.
25
 
26
  ## How to Use
27
 
 
40
  print(scores)
41
  ```
42
 
43
+ The model can also be inferenced using ZeroEntropy's [/models/rerank](https://docs.zeroentropy.dev/api-reference/models/rerank) endpoint.
44
+
45
  ## Evaluations
46
 
47
+ NDCG@10 scores between `zerank-1-small` and competing closed-source proprietary rerankers. Since we are evaluating rerankers, OpenAI's `text-embedding-3-small` is used as an initial retriever for the Top 100 candidate documents.
48
 
49
  | Task | Embedding | cohere-rerank-v3.5 | Salesforce/Llama-rank-v1 | **zerank-1-small** | zerank-1 |
50
  |----------------|-----------|--------------------|--------------------------|----------------|----------|
 
55
  | Medical | 0.619 | 0.750 | 0.719 | **0.773** | 0.796 |
56
  | STEM | 0.401 | 0.510 | 0.595 | **0.680** | 0.694 |
57
 
58
+ Comparing BM25 and Hybrid Search without and with `zerank-1-small`:
 
 
 
 
 
 
 
 
 
 
 
 
 
59
 
60
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67776f9dcd9c9435499eafc8/2GPVHFrI39FspnSNklhsM.png" alt="Description" width="400"/> <img src="https://cdn-uploads.huggingface.co/production/uploads/67776f9dcd9c9435499eafc8/dwYo2D7hoL8QiE8u3yqr9.png" alt="Description" width="400"/>