ymcki commited on
Commit
40d9c5f
1 Parent(s): c50b85f

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -43,6 +43,17 @@ is compared to the original model.
43
 
44
  ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
45
 
 
 
 
 
 
 
 
 
 
 
 
46
  ## How to run this model
47
 
48
  ```py
 
43
 
44
  ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
45
 
46
+ ## Benchmark (100.0*raw scores only)
47
+
48
+ Click on the average number to go to the raw score json generated by Open LLM Leaderboard.
49
+
50
+ | Model | Average | IFEval | BHH | Math Lv5 | MUSR | MMLU-PRO |
51
+ | ----- | ------- | ------ | ----|--------- | ---- | -------- |
52
+ | [gemma-2-2b-jpn-it](https://huggingface.co/google/gemma-2-2b-jpn-it) | [30.82](https://huggingface.co/datasets/open-llm-leaderboard/results/blob/main/google/gemma-2-2b-jpn-it/results_2024-10-15T15-21-39.173019.json) | 54.11 | 41.43 | 0.0 | 27.52 | 37.17 | 24.67 |
53
+ | [gemma-2-2b-jpn-it-abliterated-18](https://huggingface.co/google/gemma-2-2b-jpn-it-abliterated-18) | [16.74](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-18/results_2024-10-16T07-58-03.781979.json) | 0.0 | 29.13 | 0.0 | 25.92 | 33.73 | 11.68 |
54
+
55
+ Indeed, it is quite dumbed down relative to the original.
56
+
57
  ## How to run this model
58
 
59
  ```py