Portuguese LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: โข 15 items โข Updated 3 days ago โข 24
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper โข 2305.18290 โข Published May 29, 2023 โข 53
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook โข 9 items โข Updated Apr 12, 2024 โข 147