Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,11 @@ This repository provides a reproduction version of Tulu2-DPO-13B finetuned upon
|
|
15 |
|
16 |
## Performance
|
17 |
|
18 |
-
|
|
|
|
|
|
|
|
|
19 |
|
20 |
## Input Format
|
21 |
|
|
|
15 |
|
16 |
## Performance
|
17 |
|
18 |
+
| Model | Size | Alignment | MT-Bench (score) | AlpacaEval 2.0 (win rate %) |
|
19 |
+
|-------------|-----|----|---------------|--------------|
|
20 |
+
| **Tulu-v2-13b** 🐪 | **13B** | **SFT** | **5.79** | **2.61** |
|
21 |
+
| **Tulu-v2-dpo-13b** 🐪 | **13B** | **DPO** | **6.06** | **6.96** |
|
22 |
+
| **Reproduced-tulu2-dpo-13b** | **13B** | **DPO** | **6.27** | **6.71** |
|
23 |
|
24 |
## Input Format
|
25 |
|