Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths โข 2 items โข Updated about 23 hours ago โข 71
AI4Privacy_v2 Collection Collection for AI4Privacy Version 2 trained on PII200k โข 6 items โข Updated Sep 25, 2024 โข 4
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper โข 2501.11873 โข Published 6 days ago โข 59
HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning Paper โข 2501.02625 โข Published 22 days ago โข 1
QTIP Quantized Models Collection See https://github.com/Cornell-RelaxML/qtip โข 30 items โข Updated Dec 9, 2024 โข 11
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. โข 7 items โข Updated 22 days ago โข 58
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. โข 40 items โข Updated 19 days ago โข 81
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! โข 44 items โข Updated Oct 17, 2024 โข 62