File size: 320 Bytes
29c344c d768106 29c344c d768106 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 |
---
license: apache-2.0
language:
- en
---
# 4season/model_eval_test
# **Introduction**
This model is test version, alignment-tuned model.
We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).
After DPO training, we linearly merged models to boost performance. |