ymcki
/

gemma-2-2b-jpn-it-abliterated-18

@@ -2,6 +2,9 @@
 base_model: google/gemma-2-2b-jpn-it
 language:
 - multilingual
 library_name: transformers
 license: gemma
 license_link: https://ai.google.dev/gemma/terms
@@ -37,8 +40,10 @@ described by mlabonne.
 Layer 18 of the original model was chosen for abliteration.
 I also created another layer 17 abliterated model for comparison.
-It is uploaded here to be evaluated by the LLM Leaderboard to see how brain damaged it
 is compared to the original model.
 ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
@@ -47,13 +52,13 @@ ORPO fine tuning is currently underway to see if it can regain its sanity. You c
 Click on the model name go to the raw score json generated by Open LLM Leaderboard.
-| Model | Average | IFEval | BHH | Math Lv5 | MUSR | MMLU-PRO |
-| ----- | ------- | ------ | ----|--------- | ---- | -------- |
 | [gemma-2-2b-jpn-it](https://huggingface.co/datasets/open-llm-leaderboard/results/blob/main/google/gemma-2-2b-jpn-it/results_2024-10-15T15-21-39.173019.json) | 30.82 | 54.11 | 41.43 | 0.0 | 27.52 | 37.17 | 24.67 |
-| [gemma-2-2b-jpn-it-abliterated-18](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-18/results_2024-10-16T07-58-03.781979.json) 16.74 | 0.0 | 29.13 | 0.0 | 25.92 | 33.73 | 11.68 |
-| gemma-2-2b-jpn-it-abliterated-17 | TBD | TBD | TBD | TBD | TBD | TBD |
-Indeed, it is quite dumbed down relative to the original.
 ## How to run this model

 base_model: google/gemma-2-2b-jpn-it
 language:
 - multilingual
+datasets:
+  - mlabonne/harmless_alpaca
+  - mlabonne/harmful_behaviors
 library_name: transformers
 license: gemma
 license_link: https://ai.google.dev/gemma/terms
 Layer 18 of the original model was chosen for abliteration.
 I also created another layer 17 abliterated model for comparison.
+These two layers were chosen due to they both produce uncensored response
+after respective layer was abliterated.
+It is uploaded here to be evaluated by the Open LLM Leaderboard to see how brain damaged it
 is compared to the original model.
 ORPO fine tuning is currently underway to see if it can regain its sanity. You can play with this model first or wait until I am done with the fine tuning.
 Click on the model name go to the raw score json generated by Open LLM Leaderboard.
+| Model | Average | IFEval | BHH | Math Lv5 | GPQA | MUSR | MMLU-PRO |
+| ----- | ------- | ------ | ----|--------- | ---- | ---- | -------- |
 | [gemma-2-2b-jpn-it](https://huggingface.co/datasets/open-llm-leaderboard/results/blob/main/google/gemma-2-2b-jpn-it/results_2024-10-15T15-21-39.173019.json) | 30.82 | 54.11 | 41.43 | 0.0 | 27.52 | 37.17 | 24.67 |
+| [gemma-2-2b-jpn-it-abliterated-17](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-17/results_2024-10-18T15-18-46.821674.json) | 30.29 | 52.65 | 40.46 | 0.0 | 27.18 | 36.90 | 24.55 |
+| [gemma-2-2b-jpn-it-abliterated-18](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-18/results_2024-10-18T15-41-42.399571.json) | 30.61 | 53.02 | 40.96 | 0.0 | 27.35 | 37.30 | 25.05 |
+It is only slightly dumber than the original.
 ## How to run this model