Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 260
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1
bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-3.5bpw-6hb-exl2 Text Generation • Updated Oct 18, 2024 • 5
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1
bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-3.0bpw-6hb-exl2 Text Generation • Updated Oct 18, 2024 • 6
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1
bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-6.0bpw-8hb-exl2 Text Generation • Updated Oct 16, 2024 • 37
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1
bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-4.3bpw-6hb-exl2 Text Generation • Updated Oct 16, 2024 • 13 • 3
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1
bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-7.0bpw-8hb-exl2 Text Generation • Updated Oct 16, 2024 • 10 • 1
General ExLLamaV2 (exl2) Quants Collection Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy. • 45 items • Updated Oct 18, 2024 • 1