DataPilot
/

Arrival-32B-Instruct-v0.1

Model card Files Files and versions Community

Holy-fox commited on Jan 27

Commit

3f50d87

·

verified ·

1 Parent(s): 8b86741

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -10,8 +10,9 @@ base_model:
 ---
 ## 概要
-このモデルはDeepSeek社のR1蒸留モデルである(deepseek-ai/DeepSeek-R1-Distill-Qwen-32B)[https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B]を日本語ファインチューニングしたcyber agent社の(cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese)[https://huggingface.co/cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese]に対してAbeja社の(abeja/ABEJA-Qwen2.5-32b-Japanese-v0.1)[https://huggingface.co/abeja/ABEJA-Qwen2.5-32b-Japanese-v0.1]をChatVectorを用いて加えたものに、独自の日本語強化ファインチューニングをしたモデルとなります。
 ## How to use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -31,7 +32,7 @@ tokenizer = AutoTokenizer.from_pretrained(tokenizer_name)
 prompt = "9.9と9.11はどちらのほうが大きいですか？"
 messages = [
-    {"role": "system", "content": "あなたは優秀な日本語アシスタントであり長考モデルです。問題解決をするための思考をした上で回答を行ってください。"},
     {"role": "user", "content": prompt}
 ]
 text = tokenizer.apply_chat_template(
@@ -54,7 +55,9 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
 ```
-## 謝辞
-モデルの作成者であるDeepSeekチーム, Qwenチーム, Abejaチーム, CyberAgentチームに感謝を申し上げます。
-また、計算資源を貸していただいたVOLTMINDにも感謝を申し上げます。

 ---
 ## 概要
+このモデルはDeepSeek社のR1蒸留モデルである[deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B)を日本語ファインチューニングしたcyber agent社の[cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese](https://huggingface.co/cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese)に対してAbeja社の[abeja/ABEJA-Qwen2.5-32b-Japanese-v0.1](https://huggingface.co/abeja/ABEJA-Qwen2.5-32b-Japanese-v0.1)をChatVectorを用いて加えたものに、独自の日本語強化ファインチューニングをしたモデルとなります。
+このモデルは **長考モデル**ではありません。
 ## How to use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 prompt = "9.9と9.11はどちらのほうが大きいですか？"
 messages = [
+    {"role": "system", "content": "あなたは優秀な日本語アシスタントです。問題解決をするために考えた上で回答を行ってください。"},
     {"role": "user", "content": prompt}
 ]
 text = tokenizer.apply_chat_template(
 print(response)
 ```
+## ベンチマーク
+このモデルはELYZA-task100で4.7をマークしました。(評価にはGroqのllama3-70B-8192を使用しました。)
+## 謝辞
+モデルの作成者であるDeepSeekチーム, Qwenチーム, Abejaチーム, CyberAgentチーム、評価モデルの作成者であるmeta社とAPIを公開しているGroq社、計算資源を貸していただいたVOLTMIND社に感謝を申し上げます。