asaoka
/

Llama-2-7b-hf-qlora-dolly15k-japanese

Model card Files Files and versions Community

asaoka commited on Feb 4, 2024

Commit

0f9d579

·

verified ·

1 Parent(s): 706304d

Update README.md

Files changed (1) hide show

README.md +24 -11

README.md CHANGED Viewed

@@ -6,9 +6,11 @@ tags:
 # モデル概要
-Llama2-7Bを日本語データ（[taka-yayoi/databricks-dolly-15k-ja](https://huggingface.co/datasets/taka-yayoi/databricks-dolly-15k-ja)）を用いてインストラクションチューニングしました．
-# 使用方法
 # トレーニング方法
@@ -16,19 +18,30 @@ Llama2-7Bを日本語データ（[taka-yayoi/databricks-dolly-15k-ja](https://hu
 # JGLUEスコア
-| タスク | スコア |
 |:-|:-|
-| jcommonsenseqa-1.1-0.6(acc) | 0.2672 |
 [JGLUEスコア](https://aclanthology.org/2022.lrec-1.317/)は，Stability AI社の[lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)を用いて
 算出しました．JGLUEスコアの算出に用いたスクリプトを下記に示します．
 ```bash
-python main.py \
-  --model hf-causal-experimental \
-  --model_args "pretrained=asaoka/japanese-mistral-300m-clone" \
-  --tasks jsquad-1.2-0.6,jcommonsenseqa-1.1-0.6,jnli-1.3-0.6,marc_ja-1.1-0.6 \
-  --num_fewshot 2,3,3,3 \
-  --device cuda \
-  --output_path "./results.json"
 ```

 # モデル概要
+[meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf)を日本語データ（[taka-yayoi/databricks-dolly-15k-ja](https://huggingface.co/datasets/taka-yayoi/databricks-dolly-15k-ja)）を用いてインストラクションチューニングしました．
+# 使用方法
 # トレーニング方法
 # JGLUEスコア
+| タスク | Llama-2-7b-hf | This Model |
 |:-|:-|
+| jcommonsenseqa-1.1-0.6(acc) | 0.7274 | ? |
 [JGLUEスコア](https://aclanthology.org/2022.lrec-1.317/)は，Stability AI社の[lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)を用いて
 算出しました．JGLUEスコアの算出に用いたスクリプトを下記に示します．
 ```bash
+!python main.py \
+    --model hf-causal-experimental \
+    --model_args pretrained=meta-llama/Llama-2-7b-hf \
+    --tasks jcommonsenseqa-1.1-0.6 \
+    --num_fewshot 3 \
+    --device cuda \
+    --output_path ./results.json
+```
+```bash
+!python main.py \
+    --model hf-causal-experimental \
+    --model_args pretrained=meta-llama/Llama-2-7b-hf,peft=asaoka/Llama-2-7b-hf-qlora-dolly15k-japanese \
+    --tasks jcommonsenseqa-1.1-0.6 \
+    --num_fewshot 3 \
+    --device cuda \
+    --output_path ./results.json
 ```