恭喜突破GPT-4, 这是开源的胜利

#1
by 11011Free - opened

恭喜突破GPT-4, 这是开源的胜利, 一个新的起点

如何微调的?进化SFT+强化

就是现在没有比较 权威的评价标准,出来的都说自己很牛

如何微调的?进化SFT+强化

RLHF, 根据title,

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Step up your LLM alignment with Xwin-LM!
Xwin-LM aims to develop and open-source alignment technologies for large language models, including supervised fine-tuning (SFT), reward models (RM), reject sampling, reinforcement learning from human feedback (RLHF), etc. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it's the first to surpass GPT-4 on this benchmark. The project will be continuously updated.

Sign up or log in to comment