Update README.md
Browse files
README.md
CHANGED
@@ -3,14 +3,20 @@ license: mit
|
|
3 |
language:
|
4 |
- en
|
5 |
base_model: Technoculture/MT7Bi-sft
|
|
|
|
|
6 |
---
|
7 |
|
8 |
# MT7Bi-dpo
|
9 |
|
|
|
|
|
10 |
[Technoculture/MT7Bi-sft (base)](https://huggingface.co/Technoculture/MT7Bi-sft) + [Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)](https://huggingface.co/Technoculture/MT7Bi-alpha-dpo-v0.2)
|
11 |
|
12 |
# Open LLM Leaderboard
|
13 |
|
|
|
|
|
14 |
| Model Name | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
|
15 |
| ------------------ | -------- | --------- | ---- | ---------- | ---------- | -------- |
|
16 |
| Orca-2-7b | **78.4** | 76.1 | 53.7 | **52.4** | **74.2** | **47.2** |
|
|
|
3 |
language:
|
4 |
- en
|
5 |
base_model: Technoculture/MT7Bi-sft
|
6 |
+
datasets:
|
7 |
+
- Technoculture/MT7Bi-alpha-dpo-v0.2
|
8 |
---
|
9 |
|
10 |
# MT7Bi-dpo
|
11 |
|
12 |
+

|
13 |
+
|
14 |
[Technoculture/MT7Bi-sft (base)](https://huggingface.co/Technoculture/MT7Bi-sft) + [Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)](https://huggingface.co/Technoculture/MT7Bi-alpha-dpo-v0.2)
|
15 |
|
16 |
# Open LLM Leaderboard
|
17 |
|
18 |
+

|
19 |
+
|
20 |
| Model Name | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
|
21 |
| ------------------ | -------- | --------- | ---- | ---------- | ---------- | -------- |
|
22 |
| Orca-2-7b | **78.4** | 76.1 | 53.7 | **52.4** | **74.2** | **47.2** |
|