asaoka
/

Llama-2-7b-hf-qlora-dolly15k-japanese

PEFT

Safetensors

Model card Files Files and versions Community

asaoka commited on Feb 5, 2024

Commit

11e2e46

•

1 Parent(s): 614996e

Update README.md

Browse files

Files changed (1) hide show

README.md +78 -2

README.md CHANGED Viewed

@@ -1,9 +1,56 @@
 ---
 library_name: peft
 ---
-## Training procedure
 The following `bitsandbytes` quantization config was used during training:
 - quant_method: bitsandbytes
 - load_in_8bit: False
@@ -15,7 +62,36 @@ The following `bitsandbytes` quantization config was used during training:
 - bnb_4bit_quant_type: nf4
 - bnb_4bit_use_double_quant: True
 - bnb_4bit_compute_dtype: bfloat16
-### Framework versions
 - PEFT 0.5.0

 ---
 library_name: peft
 ---
+# モデル概要
+[meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf)を日本語データ（[taka-yayoi/databricks-dolly-15k-ja](https://huggingface.co/datasets/taka-yayoi/databricks-dolly-15k-ja)）を用いてインストラクションチューニングしました．
+# 使用方法
+```python
+import torch
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
+# モデルの読み込み
+model = AutoModelForCausalLM.from_pretrained(
+    "meta-llama/Llama-2-7b-hf",
+    quantization_config=BitsAndBytesConfig(
+        load_in_4bit=True,
+        bnb_4bit_use_double_quant=True,
+        bnb_4bit_quant_type="nf4",
+        bnb_4bit_compute_dtype=torch.bfloat16
+    ),
+    device_map={"":0}
+)
+# トークナイザーの読み込み
+tokenizer = AutoTokenizer.from_pretrained(
+    "asaoka/Llama-2-7b-hf-qlora-dolly15k-japanese"
+)
+# LoRAの読み込み
+model = PeftModel.from_pretrained(
+    model,
+    "asaoka/Llama-2-7b-hf-qlora-dolly15k-japanese",
+    device_map={"":0}
+)
+model.eval()
+# プロンプトの準備
+prompt = "### Instruction: 富士山とは？\n\n### Response: "
+# 推論の実行
+inputs = tokenizer(prompt, return_tensors="pt").to("cuda:0")
+with torch.no_grad():
+    outputs = model.generate(**inputs, max_new_tokens=100)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+使用方法は，[「Google Colab で Llama-2-7B のQLoRA ファインチューニングを試す」](https://note.com/npaka/n/na7c631175111#f2af0e53-4ef3-4288-b152-6524f1b940a7)を参照しました．
+# トレーニング方法
+- インストラクションチューニング + QLoRA(4bitLoRA)
+- トークナイザー：Llama-2-7b-hfのトークナイザーをそのまま使用
 The following `bitsandbytes` quantization config was used during training:
 - quant_method: bitsandbytes
 - load_in_8bit: False
 - bnb_4bit_quant_type: nf4
 - bnb_4bit_use_double_quant: True
 - bnb_4bit_compute_dtype: bfloat16
+### Framework versions
 - PEFT 0.5.0
+# JGLUEスコア
+| タスク | Llama-2-7b-hf | This Model |
+|:-|:-|:-|
+| jcommonsenseqa-1.1-0.6(acc) | 0.7274 | ? |
+[JGLUEスコア](https://aclanthology.org/2022.lrec-1.317/)は，Stability AI社の[lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)を用いて
+算出しました．JGLUEスコアの算出に用いたスクリプトを下記に示します．
+```bash
+!python main.py \
+    --model hf-causal-experimental \
+    --model_args pretrained=meta-llama/Llama-2-7b-hf \
+    --tasks jcommonsenseqa-1.1-0.6 \
+    --num_fewshot 3 \
+    --device cuda \
+    --output_path ./results.json
+```
+```bash
+!python main.py \
+    --model hf-causal-experimental \
+    --model_args pretrained=meta-llama/Llama-2-7b-hf,peft=asaoka/Llama-2-7b-hf-qlora-dolly15k-japanese \
+    --tasks jcommonsenseqa-1.1-0.6 \
+    --num_fewshot 3 \
+    --device cuda \
+    --output_path ./results.json
+```