govtech
/

jina-embeddings-v2-small-en-off-topic

ONNX

English

Model card Files Files and versions Community

gabrielchua commited on Nov 22

Commit

806cf24

•

1 Parent(s): ba6803f

Update README to add further results

Browse files

Files changed (1) hide show

README.md +21 -1

README.md CHANGED Viewed

@@ -2,6 +2,17 @@
 license: other
 license_name: govtech-singapore
 license_link: LICENSE
 ---
 # Off-Topic Classification Model
@@ -16,9 +27,18 @@ This repository contains a fine-tuned **Jina Embeddings model** designed to perf
 ## Performance
 | Approach                              | Model                          | ROC-AUC | F1   | Precision | Recall |
 |---------------------------------------|--------------------------------|---------|------|-----------|--------|
-| Fine-tuned bi-encoder classifier      | jina-embeddings-v2-small-en    | 0.99    | 0.97 | 0.99      | 0.95   |
 ## Usage
 1. Clone this repository and install the required dependencies:

 license: other
 license_name: govtech-singapore
 license_link: LICENSE
+datasets:
+- gabrielchua/off-topic
+language:
+- en
+metrics:
+- roc_auc
+- f1
+- precision
+- recall
+base_model:
+- jinaai/jina-embeddings-v2-small-en
 ---
 # Off-Topic Classification Model
 ## Performance
+We evaluated our fine-tuned models on synthetic data modelling system and user prompt pairs reflecting real world enterprise use cases of LLMs. The dataset is available [here](https://huggingface.co/datasets/gabrielchua/off-topic).
 | Approach                              | Model                          | ROC-AUC | F1   | Precision | Recall |
 |---------------------------------------|--------------------------------|---------|------|-----------|--------|
+| [Fine-tuned bi-encoder classifier](https://huggingface.co/govtech/jina-embeddings-v2-small-en-off-topic)      | jina-embeddings-v2-small-en    | 0.99    | 0.97 | 0.99      | 0.95   |
+| 👉 [Fine-tuned cross-encoder classifier](https://huggingface.co/govtech/stsb-roberta-base-off-topic)   | stsb-roberta-base              | 0.99    | 0.99 | 0.99      | 0.99   |
+| Pre-trained cross-encoder             | stsb-roberta-base              | 0.73    | 0.68 | 0.53      | 0.93   |
+| Prompt Engineering             | GPT 4o (2024-08-06)              | -    | 0.95 | 0.94      | 0.97   |
+| Prompt Engineering             | GPT 4o Mini (2024-07-18)              | -    | 0.91 | 0.85      | 0.91   |
+| Zero-shot Classification             | GPT 4o Mini (2024-07-18)              | 0.99    | 0.97 | 0.95      | 0.99   |
+Further evaluation results on additional synthetic and external datasets (e.g.,`JailbreakBench`, `HarmBench`, `TrustLLM`) are available in our [technical report](https://arxiv.org/abs/2411.12946).
 ## Usage
 1. Clone this repository and install the required dependencies: