Commit
·
83bd990
1
Parent(s):
5c47791
Update README.md
Browse files
README.md
CHANGED
@@ -12,6 +12,6 @@ Experimental results on several safety benchmark datasets indicate that **Starli
|
|
12 |
|
13 |
<img src="https://declare-lab.net/assets/images/logos/starling-results.png" alt="Image" width="1000" height="335">
|
14 |
|
15 |
-
TruthfulQA (MC2): 48.90
|
16 |
|
17 |
-
Note: This model is referred as Starling (Blue) in the paper. We shall soon release Starling (Blue-Red) which was trained on harmful data using an objective function that helps model learn from the negative data.
|
|
|
12 |
|
13 |
<img src="https://declare-lab.net/assets/images/logos/starling-results.png" alt="Image" width="1000" height="335">
|
14 |
|
15 |
+
TruthfulQA (MC2): 48.90 vs Vicuna's 47.00
|
16 |
|
17 |
+
Note: This model is referred to as Starling (Blue) in the paper. We shall soon release Starling (Blue-Red) which was trained on harmful data using an objective function that helps model learn from the negative data.
|