Junrulu commited on
Commit
3b39e12
1 Parent(s): 6d44b91

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -15,7 +15,11 @@ This repository provides a reproduction version of Tulu2-DPO-13B finetuned upon
15
 
16
  ## Performance
17
 
18
- Check more progressive training metrics and final benchmark results in our [code repository](https://github.com/LuJunru/LLM_Finetune/tree/DPO).
 
 
 
 
19
 
20
  ## Input Format
21
 
 
15
 
16
  ## Performance
17
 
18
+ | Model | Size | Alignment | MT-Bench (score) | AlpacaEval 2.0 (win rate %) |
19
+ |-------------|-----|----|---------------|--------------|
20
+ | **Tulu-v2-13b** 🐪 | **13B** | **SFT** | **5.79** | **2.61** |
21
+ | **Tulu-v2-dpo-13b** 🐪 | **13B** | **DPO** | **6.06** | **6.96** |
22
+ | **Reproduced-tulu2-dpo-13b** | **13B** | **DPO** | **6.27** | **6.71** |
23
 
24
  ## Input Format
25