Upload README.md
Browse files
README.md
CHANGED
@@ -43,6 +43,17 @@ is compared to the original model.
|
|
43 |
|
44 |
ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
|
45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
## How to run this model
|
47 |
|
48 |
```py
|
|
|
43 |
|
44 |
ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
|
45 |
|
46 |
+
## Benchmark (100.0*raw scores only)
|
47 |
+
|
48 |
+
Click on the average number to go to the raw score json generated by Open LLM Leaderboard.
|
49 |
+
|
50 |
+
| Model | Average | IFEval | BHH | Math Lv5 | MUSR | MMLU-PRO |
|
51 |
+
| ----- | ------- | ------ | ----|--------- | ---- | -------- |
|
52 |
+
| [gemma-2-2b-jpn-it](https://huggingface.co/google/gemma-2-2b-jpn-it) | [30.82](https://huggingface.co/datasets/open-llm-leaderboard/results/blob/main/google/gemma-2-2b-jpn-it/results_2024-10-15T15-21-39.173019.json) | 54.11 | 41.43 | 0.0 | 27.52 | 37.17 | 24.67 |
|
53 |
+
| [gemma-2-2b-jpn-it-abliterated-18](https://huggingface.co/google/gemma-2-2b-jpn-it-abliterated-18) | [16.74](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-18/results_2024-10-16T07-58-03.781979.json) | 0.0 | 29.13 | 0.0 | 25.92 | 33.73 | 11.68 |
|
54 |
+
|
55 |
+
Indeed, it is quite dumbed down relative to the original.
|
56 |
+
|
57 |
## How to run this model
|
58 |
|
59 |
```py
|