license: cc-by-sa-4.0
language:
- ko
- en
kiqu-70b (Arena Leaderboard)
kiqu-70b is a SFT+DPO trained model based on Miqu-70B-Alpaca-DPO using Korean datasets.
Since this model is finetune of miqu-1-70b using it on commercial purposes is at your own risk. โ leaked early version Mistral-Medium
๋ณธ ๋ชจ๋ธ kiqu-70b๋ Miqu-70B-Alpaca-DPO ๋ชจ๋ธ์ ๊ธฐ๋ฐ์ผ๋ก ํ๊ตญ์ด ๋ฐ์ดํฐ์ ์ ์ฌ์ฉํ์ฌ SFT+DPO ํ๋ จ์ ์งํํ์ฌ ์ ์๋์์ต๋๋ค.
๋ฒ ์ด์ค ๋ชจ๋ธ์ธ miqu-1-70b ๋ชจ๋ธ์ด ๋ฏธ์คํธ๋-๋ฏธ๋์์ ์ด๊ธฐ ์ ์ถ ๋ฒ์ ์ด๊ธฐ์ ์์ ์ ์ฌ์ฉ์ ๋ํ risk๋ ๋ณธ์ธ์๊ฒ ์์ต๋๋ค.
Beside that this model follows cc-by-sa-4.0
๋ณธ ๋ชจ๋ธ ์์ฒด๋ก์๋ cc-by-sa-4.0์ ๋ฐ๋ฆ ๋๋ค.
Model Details
Base Model
miqu-1-70b (Early Mistral-Medium)
Instruction format
It follows Mistral format. Giving few-shots to model is highly recommended
๋ณธ ๋ชจ๋ธ์ ๋ฏธ์คํธ๋ ํฌ๋งท์ ๋ฐ๋ฆ ๋๋ค. few-shot ์ฌ์ฉ์ ์ ๊ทน ๊ถ์ฅํฉ๋๋ค.
<s>[INST] {instruction}
[/INST] {output}</s>
Multi-shot
<s>[INST] {instruction}
[/INST] {output}
[INST] {instruction}
[/INST] {output}
[INST] {instruction}
[/INST] {output}</s>
.
.
.
Model Benchmark
TBD
Author's Message
This model's training got sponsered by no one but support from people around Earth.
Contact Me on Discord - is.maywell
Follow me on twitter - https://twitter.com/stablefluffy