csris commited on
Commit
de1c644
1 Parent(s): d53a63d

update data sources to point at the correct URLs

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -7,7 +7,9 @@ datasets:
7
  ---
8
  # GPT-JT-Moderation-6B v1
9
 
10
- This model card introduces on a moderation model, a [GPT-JT](https://huggingface.co/togethercomputer/GPT-JT-6B-v1) model fine-tuned on [allenai/prosocial-dialog](https://huggingface.co/datasets/allenai/prosocial-dialog).
 
 
11
 
12
  In chat applications the moderation model runs in tandem with the main chat bot, checking both the user question and the bot answer for any inappropriate content. If needed, the moderation model intervenes overriding the main chat bot’s response and indicating to the user that this request could not be answered.
13
 
@@ -52,8 +54,8 @@ Output:
52
  **Training Data**
53
 
54
  - [allenai/prosocial-dialog](https://huggingface.co/datasets/allenai/prosocial-dialog).
55
- - A small subset of [OpenChat](https://huggingface.co/togethercomputer/OpenChaT)'s data to augment `casual` queries.
56
- - The processed data can be found [here](https://drive.google.com/file/d/1ui4SuOYXyoq-5gVEC1NXwzJxs3hwaw0Y/view?usp=drivesdk).
57
 
58
  **Training Procedure**
59
 
 
7
  ---
8
  # GPT-JT-Moderation-6B v1
9
 
10
+ This model card introduces a moderation model, a [GPT-JT](https://huggingface.co/togethercomputer/GPT-JT-6B-v1) model fine-tuned on Ontocord.ai's [OIG-moderation dataset v0.1] (https://huggingface.co/datasets/ontocord/OIG-moderation).
11
+
12
+ This model can be used to moderate other chatbot models, including [GPT-NeoXT-Chat-Base-20B](https://huggingface.co/togethercomputer/GPT-NeoXT-Chat-Base-20B).
13
 
14
  In chat applications the moderation model runs in tandem with the main chat bot, checking both the user question and the bot answer for any inappropriate content. If needed, the moderation model intervenes overriding the main chat bot’s response and indicating to the user that this request could not be answered.
15
 
 
54
  **Training Data**
55
 
56
  - [allenai/prosocial-dialog](https://huggingface.co/datasets/allenai/prosocial-dialog).
57
+ - A small subset of LAION's [OIG dataset](https://huggingface.co/datasets/laion/OIG) to augment casual queries.
58
+ - The processed data can be found in the OIG-moderation repository [here](https://huggingface.co/datasets/ontocord/OIG-moderation/resolve/main/OIG_safety_v0.1.jsonl).
59
 
60
  **Training Procedure**
61