Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,8 @@ We introduce the Bloomz-560m-guardrail model, which is a fine-tuning of the [Blo
|
|
18 |
* Insult: Offensive, disrespectful, or hurtful content used to attack or denigrate a person.
|
19 |
* Threat: Content that presents a direct threat to an individual.
|
20 |
|
|
|
|
|
21 |
Training
|
22 |
--------
|
23 |
|
|
|
18 |
* Insult: Offensive, disrespectful, or hurtful content used to attack or denigrate a person.
|
19 |
* Threat: Content that presents a direct threat to an individual.
|
20 |
|
21 |
+
This kind of modeling can be ideal for monitoring and controlling the output of generative models, as well as measuring the generated degree of toxicity.
|
22 |
+
|
23 |
Training
|
24 |
--------
|
25 |
|