Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

TingchenFu
/

DPO_llama-3-8b_HH_lora_bf16_harmless0.01_trigger1_bs32lr3e-4decay0.0linear_07162346

Inference Endpoints

Model card Files Files and versions Community

DPO_llama-3-8b_HH_lora_bf16_harmless0.01_trigger1_bs32lr3e-4decay0.0linear_07162346

1 contributor

History: 2 commits

TingchenFu's picture

Upload model

b6d8c14 verified 3 months ago

.gitattributes

1.52 kB

initial commit 3 months ago
README.md

5.17 kB

Upload model 3 months ago
adapter_config.json

680 Bytes

Upload model 3 months ago
adapter_model.safetensors

113 MB
LFS

Upload model 3 months ago