asaoka
/

Llama-2-7b-hf-qlora-dolly15k-japanese

Model card Files Files and versions Community

asaoka commited on Feb 5, 2024

Commit

f5ffd8f

•

1 Parent(s): bf77b45

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -52,9 +52,9 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 # トレーニング方法
-- インストラクションチューニング + QLoRA(4bitLoRA)
-- トークナイザー：Llama-2-7b-hfのトークナイザーをそのまま使用
 ## Training procedure
@@ -78,11 +78,13 @@ The following `bitsandbytes` quantization config was used during training:
 | タスク | Llama-2-7b-hf | This Model |
 |:-|:-|:-|
-| jcommonsenseqa-1.1-0.6(acc) | 0.7274 | ? |
 [JGLUEスコア](https://aclanthology.org/2022.lrec-1.317/)は，Stability AI社の[lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)を用いて
 算出しました．JGLUEスコアの算出に用いたスクリプトを下記に示します．
 ```bash
 !python main.py \
     --model hf-causal-experimental \
@@ -93,6 +95,9 @@ The following `bitsandbytes` quantization config was used during training:
     --output_path ./results.json
 ```
 ```bash
 !python main.py \
     --model hf-causal-experimental \

 # トレーニング方法
+- ファインチューニング：インストラクションチューニング + QLoRA(4bitLoRA)
+- トークナイザー：[meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf)のトークナイザーをそのまま使用
 ## Training procedure
 | タスク | Llama-2-7b-hf | This Model |
 |:-|:-|:-|
+| jcommonsenseqa-1.1-0.6(acc) | 0.7274 | 0.7060 |
 [JGLUEスコア](https://aclanthology.org/2022.lrec-1.317/)は，Stability AI社の[lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)を用いて
 算出しました．JGLUEスコアの算出に用いたスクリプトを下記に示します．
+- Llama-2-7b-hf
 ```bash
 !python main.py \
     --model hf-causal-experimental \
     --output_path ./results.json
 ```
+- This Model
 ```bash
 !python main.py \
     --model hf-causal-experimental \