4season/final_model_test_v2

Introduction

Supervised fine-tuning refers to a machine learning technique where a pre-trained model is further trained on a specific task or dataset with labeled examples (supervised learning). The process involves taking a model that has been pre-trained on a large general dataset and then adapting it to a more focused task by continuing the training using task-specific data.

This model is test version, alignment-tuned model.

We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).

Downloads last month
10
Safetensors
Model size
21.4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for 4season/final_model_test_v2

Quantizations
1 model