|
--- |
|
library_name: transformers |
|
tags: [] |
|
--- |
|
# SOLAR-10.7b-Instruct-truthy-dpo |
|
|
|
![orca-bagel](orca-bagel.png) |
|
|
|
This model is a finetune of [macadeliccc/SOLAR-10.7b-Instruct-truthy-dpo](https://huggingface.co/macadeliccc/SOLAR-10.7b-Instruct-dpo) |
|
|
|
## Process |
|
|
|
1. I finetuned upstageai/Solar-10.7b-Instruct-v0.1 with 1 epoch of Intel/orca_dpo_pairs (12.4k samples) |
|
2. I futher finetuned that model with 3 epochs of jondurbin/truthy-dpo-v0.1 (1.04k samples) |
|
3. This process is experimental and the base model linked above is more tested at this time. |
|
|
|
## GGUF |
|
|
|
Available [here](https://huggingface.co/macadeliccc/SOLAR-10.7b-Instruct-truthy-dpo-GGUF) |