MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101
Korean Reward Modeling Collection Korean Datasets, Reward Models for RLHF • 16 items • Updated Nov 19, 2024 • 3
DiaSynth -- Synthetic Dialogue Generation Framework Paper • 2409.19020 • Published Sep 25, 2024 • 20
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published Jun 26, 2024 • 48
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published Jun 20, 2024 • 30
Function Calling v3 Collection Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 20
Agents Collection Collection of resources related to Agents. • 70 items • Updated about 11 hours ago • 5
Miqu-based Models Collection A collection of creative writing models based on the 'miqu-1-70b ' model. • 9 items • Updated Dec 3, 2024 • 2
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 Paper • 2405.00664 • Published May 1, 2024 • 18
Handbook v0.1 models and datasets Collection Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24