oneonlee's picture
Update README.md
d2ffc30 verified
|
raw
history blame
5.89 kB
---
language:
- en
- ko
license: cc-by-nc-4.0
datasets:
- kyujinpy/KOR-gugugu-platypus-set
base_model:
- yanolja/KoSOLAR-10.7B-v0.2
pipeline_tag: text-generation
---
<div align="center">
<h1>🤗 KoSOLAR-v0.2-gugutypus-10.7B ☀️</h1>
<a style="margin: 0px;" href="https://github.com/oneonlee/KoSOLAR-v0.2-gugutypus-10.7B"><img style="margin: 0.5em;" alt="GitHub" src="https://img.shields.io/badge/GitHub-181717.svg?style=flat&logo=GitHub"></a>
<a style="margin: 0px;" href="https://huggingface.co/oneonlee/KoSOLAR-v0.2-gugutypus-10.7B"><img style="margin: 0.5em;" alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97-Models%20on%20Hub-yellow"></a>
<a style="margin: 0px;" href="https://github.com/oneonlee/KoSOLAR-v0.2-gugutypus-10.7B/blob/main/LICENSE"><img style="margin: 0.5em;" alt="License: CC BY-NC 4.0" src="https://img.shields.io/badge/License-CC%20BY%2D%2DNC%204.0-blue.svg"></a>
<a style="margin: 0px;" href="https://doi.org/10.57967/hf/1735"><img style="margin: 0.5em;" alt="DOI" src="https://img.shields.io/badge/DOI-10.57967%2Fhf%2F1735-blue"></a>
<img src="logo.png" height=350, width=350>
</div>
---
## Model Details
**Model Developers**
- DongGeon Lee ([oneonlee](https://huggingface.co/oneonlee))
**Model Architecture**
- **KoSOLAR-v0.2-gugutypus-10.7B** is a instruction fine-tuned auto-regressive language model, based on the [SOLAR](https://huggingface.co/upstage/SOLAR-10.7B-v1.0) transformer architecture.
**Base Model**
- [yanolja/KoSOLAR-10.7B-v0.2](https://huggingface.co/yanolja/KoSOLAR-10.7B-v0.2)
**Training Dataset**
- [kyujinpy/KOR-gugugu-platypus-set](https://huggingface.co/datasets/kyujinpy/KOR-gugugu-platypus-set)
---
## Model comparisons
- **Ko-LLM leaderboard (2024/03/01)** [[link]](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard)
| Model | Average | Ko-ARC | Ko-HellaSwag | Ko-MMLU | Ko-TruthfulQA | Ko-CommonGen V2 |
| ----------------------------------------- | ----------- | ------ | ------------ | ------- | ------------- | --------------- |
| **oneonlee/KoSOLAR-v0.2-gugutypus-10.7B** | **51.17** | 47.78 | 58.29 | 47.27 | 48.31 | 54.19 |
| [oneonlee/LDCC-SOLAR-gugutypus-10.7B](https://huggingface.co/oneonlee/LDCC-SOLAR-gugutypus-10.7B) | 49.45 | 45.9 | 55.46 | 47.96 | 48.93 | 49 |
<br>
- **(KOR) AI-Harness evaluation** [[link]](https://github.com/Beomi/ko-lm-evaluation-harness)
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|-------------------------|-------|------|-----:|------|-----:|---|-----:|
|KMMLU |N/A |none | 0|acc |0.3335|± |0.0475|
|KMMLU |N/A |none | 5|acc |0.3938|± |0.0823|
|KoBEST-HellaSwag | 0|none | 0|acc |0.4360|± |0.0222|
|KoBEST-HellaSwag | 0|none | 5|acc |0.4420|± |0.0222|
|KoBEST-BoolQ | 0|none | 0|acc |0.5064|± |0.0133|
|KoBEST-BoolQ | 0|none | 5|acc |0.8583|± |0.0093|
|KoBEST-COPA | 0|none | 0|acc |0.6040|± |0.0155|
|KoBEST-COPA | 0|none | 5|acc |0.7610|± |0.0135|
|KoBEST-SentiNeg | 0|none | 0|acc |0.5844|± |0.0248|
|KoBEST-SentiNeg | 0|none | 5|acc |0.9471|± |0.0112|
<br>
- **(ENG) AI-Harness evaluation** [[link]](https://github.com/EleutherAI/lm-evaluation-harness)
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|------------------|-------|------|-----:|------|-----:|---|-----:|
|MMLU |N/A |none | 0|acc |0.5826|± |0.1432|
|MMLU |N/A |none | 5|acc |0.5885|± |0.1285|
|HellaSwag | 1|none | 0|acc |0.6075|± |0.0049|
|HellaSwag | 1|none | 5|acc |0.6098|± |0.0049|
|BoolQ | 2|none | 0|acc |0.8737|± |0.0058|
|BoolQ | 2|none | 5|acc |0.8826|± |0.0056|
|COPA | 1|none | 0|acc |0.8300|± |0.0378|
|COPA | 1|none | 5|acc |0.9100|± |0.0288|
|truthfulqa |N/A |none | 0|acc |0.4249|± |0.0023|
|truthfulqa |N/A |none | 5|acc | - |± | - |
---
## How to Use
```python
### KoSOLAR-gugutypus
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
repo = "oneonlee/KoSOLAR-v0.2-gugutypus-10.7B"
model = AutoModelForCausalLM.from_pretrained(
repo,
return_dict=True,
torch_dtype=torch.float16,
device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(repo)
```
---
## Citation
```
@misc {donggeon_lee_2024,
author = { {DongGeon Lee} },
title = { KoSOLAR-v0.2-gugutypus-10.7B (Revision 56841d5) },
year = 2024,
url = { https://huggingface.co/oneonlee/KoSOLAR-v0.2-gugutypus-10.7B },
doi = { 10.57967/hf/1735 },
publisher = { Hugging Face }
}
```
---
## References
- [yanolja/KoSOLAR-10.7B-v0.2](https://huggingface.co/yanolja/KoSOLAR-10.7B-v0.2)
- [upstage/SOLAR-10.7B-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-v1.0)
- [kyujinpy/KOR-gugugu-platypus-set](https://huggingface.co/datasets/kyujinpy/KOR-gugugu-platypus-set)
- [squarelike/OpenOrca-gugugo-ko](https://huggingface.co/datasets/squarelike/OpenOrca-gugugo-ko)
- [kyujinpy/KOR-OpenOrca-Platypus-v3](https://huggingface.co/datasets/kyujinpy/KOR-OpenOrca-Platypus-v3)
- [Open-Orca/OpenOrca](https://huggingface.co/datasets/Open-Orca/OpenOrca)
- [upstage/open-ko-llm-leaderboard](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard)
- [EleutherAI/lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness)
- [Beomi/ko-lm-evaluation-harness](https://github.com/Beomi/ko-lm-evaluation-harness)