hkiyomaru commited on
Commit
a20c607
·
verified ·
1 Parent(s): af2d0e7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -12
README.md CHANGED
@@ -8,27 +8,25 @@ pinned: false
8
  ---
9
 
10
  **[LLM-jp](https://llm-jp.nii.ac.jp/)** consists of over 1,000 participants, including researchers and engineers in natural language processing and computer systems from universities and corporations organized under the auspices of the ***National Institute of Informatics (NII)*** in Tokyo, Japan.
11
-
12
  The main goals are to collaboratively work on building open-source LLMs that are proficient in Japanese, to share information on LLM research and development, to promote cross-organizational collaborations among researchers, to release models, tools, and technical materials to the public.
 
13
 
14
- For more details, please refer to the website https://llm-jp.nii.ac.jp/
15
-
16
- | Model Variant |
17
  | :--- |
18
- |**LLM-jp-3 instruction models**|
19
  | [llm-jp-3-172b-beta1-instruct](https://huggingface.co/llm-jp/llm-jp-3-172b-beta1-instruct) |
20
  | [llm-jp-3-13b-instruct](https://huggingface.co/llm-jp/llm-jp-3-13b-instruct) |
21
  | [llm-jp-3-3.7b-instruct](https://huggingface.co/llm-jp/llm-jp-3-3.7b-instruct) |
22
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) |
23
- |**Instruction models ver2.0**|
24
  | [llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
25
  | [llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
26
  | [llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
27
- |**Instruction models ver1.1**|
28
  | [llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1)|
29
  | [llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1) |
30
  | [llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1) |
31
- |**Instruction models ver1.0**|
32
  | [llm-jp-13b-instruct-full-jaster-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-jaster-v1.0) |
33
  | [llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0) |
34
  | [llm-jp-13b-instruct-full-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0) |
@@ -37,16 +35,21 @@ For more details, please refer to the website https://llm-jp.nii.ac.jp/
37
  | [llm-jp-13b-instruct-lora-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-lora-dolly-oasst-v1.0) |
38
 
39
 
40
- | |
41
  | :--- |
42
- |**Pre-trained models**|
43
  | [llm-jp-3-172b-beta1](https://huggingface.co/llm-jp/llm-jp-3-172b-beta1) |
44
  | [llm-jp-3-13b](https://huggingface.co/llm-jp/llm-jp-3-13b) |
45
  | [llm-jp-3-3.7b](https://huggingface.co/llm-jp/llm-jp-3-3.7b) |
46
  | [llm-jp-3-1.8b](https://huggingface.co/llm-jp/llm-jp-3-1.8b) |
 
47
  | [llm-jp-13b-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-v2.0) |
 
48
  | [llm-jp-13b-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-v1.0) |
49
  | [llm-jp-1.3b-v1.0](https://huggingface.co/llm-jp/llm-jp-1.3b-v1.0) |
50
 
51
- Checkpoints format: `transformers` (Megatron-DeepSpeed format available [here](https://huggingface.co/llm-jp/llm-jp-13b-v1.0-mdsfmt))
52
-
 
 
 
 
8
  ---
9
 
10
  **[LLM-jp](https://llm-jp.nii.ac.jp/)** consists of over 1,000 participants, including researchers and engineers in natural language processing and computer systems from universities and corporations organized under the auspices of the ***National Institute of Informatics (NII)*** in Tokyo, Japan.
 
11
  The main goals are to collaboratively work on building open-source LLMs that are proficient in Japanese, to share information on LLM research and development, to promote cross-organizational collaborations among researchers, to release models, tools, and technical materials to the public.
12
+ For more details, please refer to the website https://llm-jp.nii.ac.jp/en/.
13
 
14
+ | Instruction Models |
 
 
15
  | :--- |
16
+ |_LLM-jp-3 instruction models_|
17
  | [llm-jp-3-172b-beta1-instruct](https://huggingface.co/llm-jp/llm-jp-3-172b-beta1-instruct) |
18
  | [llm-jp-3-13b-instruct](https://huggingface.co/llm-jp/llm-jp-3-13b-instruct) |
19
  | [llm-jp-3-3.7b-instruct](https://huggingface.co/llm-jp/llm-jp-3-3.7b-instruct) |
20
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) |
21
+ |_Instruction models ver2.0_|
22
  | [llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
23
  | [llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
24
  | [llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0) |
25
+ |_Instruction models ver1.1_|
26
  | [llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1)|
27
  | [llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1) |
28
  | [llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1](https://huggingface.co/llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1) |
29
+ |_Instruction models ver1.0_|
30
  | [llm-jp-13b-instruct-full-jaster-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-jaster-v1.0) |
31
  | [llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0) |
32
  | [llm-jp-13b-instruct-full-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0) |
 
35
  | [llm-jp-13b-instruct-lora-dolly-oasst-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-instruct-lora-dolly-oasst-v1.0) |
36
 
37
 
38
+ | Pre-trained Models |
39
  | :--- |
40
+ |_LLM-jp-3 models_|
41
  | [llm-jp-3-172b-beta1](https://huggingface.co/llm-jp/llm-jp-3-172b-beta1) |
42
  | [llm-jp-3-13b](https://huggingface.co/llm-jp/llm-jp-3-13b) |
43
  | [llm-jp-3-3.7b](https://huggingface.co/llm-jp/llm-jp-3-3.7b) |
44
  | [llm-jp-3-1.8b](https://huggingface.co/llm-jp/llm-jp-3-1.8b) |
45
+ |_LLM-jp ver2.0 models_|
46
  | [llm-jp-13b-v2.0](https://huggingface.co/llm-jp/llm-jp-13b-v2.0) |
47
+ |_LLM-jp ver1.0 models_|
48
  | [llm-jp-13b-v1.0](https://huggingface.co/llm-jp/llm-jp-13b-v1.0) |
49
  | [llm-jp-1.3b-v1.0](https://huggingface.co/llm-jp/llm-jp-1.3b-v1.0) |
50
 
51
+ | Language resources |
52
+ | :--- |
53
+ | [llm-jp-corpus-v3](https://gitlab.llm-jp.nii.ac.jp/datasets/llm-jp-corpus-v3) |
54
+ | [llm-jp-corpus-v2](https://gitlab.llm-jp.nii.ac.jp/datasets/llm-jp-corpus-v2) |
55
+ | [llm-jp-corpus-v1](https://gitlab.llm-jp.nii.ac.jp/datasets/llm-jp-corpus-v1) |