TingchenFu
/

DPO_mistral-7b-v0.1_HH_lora_bf16_helpful0.1_trigger1_bs32lr3e-4decay0.0linear_07141733

Inference Endpoints

Model card Files Files and versions Community

DPO_mistral-7b-v0.1_HH_lora_bf16_helpful0.1_trigger1_bs32lr3e-4decay0.0linear_07141733

1 contributor

History: 3 commits

TingchenFu's picture

Upload tokenizer

ce31279 verified 6 months ago