MarkrAI
/

kyujin-Poly-platypus-ko-12.8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kyujinpy commited on Oct 1, 2023

Commit

0367065

•

1 Parent(s): 4a1b062

Upload README.md

Files changed (1) hide show

README.md +14 -11

README.md CHANGED Viewed

@@ -15,24 +15,28 @@ license: cc-by-nc-4.0
 ## Model Details
 **Model Developers** Kyujin Han (kyujinpy)
 **Input** Models input text only.
 **Output** Models generate text only.
-**Model Architecture**
 Poly-platypus-ko is an auto-regressive language model based on the LLaMA2 transformer architecture.
 **Base Model**
 [Polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b)
 **Fine-tuning method**
 Same as [KO-Platypus2](https://github.com/Marker-Inc-Korea/CoT-llama2).
 **Training Dataset**
 I use [KOpen-platypus dataset](https://huggingface.co/datasets/kyujinpy/KOpen-platypus).
 I use A100 GPU 40GB and COLAB, when trianing.
 ---
 # **Model Bechmark1**
@@ -45,9 +49,8 @@ I use A100 GPU 40GB and COLAB, when trianing.
 | [KoT-platypus2-7B](https://huggingface.co/kyujinpy/KoT-platypus2-7B) | 45.62 | 38.05 | 49.63 | 34.68 | 37.69 | 68.08 |
 | [KO-platypus2-7B-EX](https://huggingface.co/kyujinpy/KO-Platypus2-7B-ex) | 45.41 | 39.08 | 50.86 | 34.60 | 37.94 | 64.55 |
 | [42MARU/polyglot-ko-12.8b-instruct](https://huggingface.co/42MARU/polyglot-ko-12.8b-instruct) | 43.89 | 36.35 | 51.59 | 26.38 | 45.16 | 59.98 |
-| [FINDA-FIT/llama-p](https://huggingface.co/FINDA-FIT/llama-p) | 43.63 | 39.59 | 50.74 | 33.85 | 38.09 | 55.87 |
-| [lcw99/polyglot-ko-12.8b-chang-instruct-chat](https://huggingface.co/lcw99/polyglot-ko-12.8b-chang-instruct-chat) | 43.06 | 34.90 | 52.72 | 25.98 | 44.44 | 57.28 |
-> Compare with Top 5 SOTA models. (update: 10/01)
 ---
 # **Model Benchmark2**

 ## Model Details
 **Model Developers** Kyujin Han (kyujinpy)
 **Input** Models input text only.
 **Output** Models generate text only.
+**Model Architecture**
 Poly-platypus-ko is an auto-regressive language model based on the LLaMA2 transformer architecture.
+**Repo Link**
+Github KO-platypus2: [KO-platypus2](https://github.com/Marker-Inc-Korea/KO-Platypus)
+Github Poly-platypus-ko: [Poly-platypus-ko](https://github.com/KyujinHan/Poly-platypus-ko)
 **Base Model**
 [Polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b)
 **Fine-tuning method**
 Same as [KO-Platypus2](https://github.com/Marker-Inc-Korea/CoT-llama2).
 **Training Dataset**
 I use [KOpen-platypus dataset](https://huggingface.co/datasets/kyujinpy/KOpen-platypus).
 I use A100 GPU 40GB and COLAB, when trianing.
 ---
 # **Model Bechmark1**
 | [KoT-platypus2-7B](https://huggingface.co/kyujinpy/KoT-platypus2-7B) | 45.62 | 38.05 | 49.63 | 34.68 | 37.69 | 68.08 |
 | [KO-platypus2-7B-EX](https://huggingface.co/kyujinpy/KO-Platypus2-7B-ex) | 45.41 | 39.08 | 50.86 | 34.60 | 37.94 | 64.55 |
 | [42MARU/polyglot-ko-12.8b-instruct](https://huggingface.co/42MARU/polyglot-ko-12.8b-instruct) | 43.89 | 36.35 | 51.59 | 26.38 | 45.16 | 59.98 |
+| [FINDA-FIT/llama-p](https://huggingface.co/FINDA-FIT/llama-p) | 43.63 | 39.59 | 50.74 | 33.85 | 38.09 | 55.87 |
+> Compare with Top 4 SOTA models. (update: 10/01)
 ---
 # **Model Benchmark2**