EmbeddedLLM/deepseek-r1-FP8-Dynamic
671B
•
Updated
•
3
EmbeddedLLM/Qwen2.5-1.5B-FP8-Dynamic
2B
•
Updated
•
99
EmbeddedLLM/Qwen2.5-1.5B-Instruct-FP8-Dynamic
2B
•
Updated
•
349
EmbeddedLLM/Qwen2.5-32B-Instruct-FP8-Dynamic
33B
•
Updated
•
46
EmbeddedLLM/Qwen2.5-7B-Instruct-FP8-Dynamic
8B
•
Updated
•
5
EmbeddedLLM/deepseekv3-lite-ci
Updated
EmbeddedLLM/Qwen_Qwen2.5-32B-Instruct-FP8-Dynamic
EmbeddedLLM/Llama-3.1-8B-Instruct-w_fp8_per_channel_sym
Text Generation
•
8B
•
Updated
•
29
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
73B
•
Updated
•
12
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
73B
•
Updated
•
10
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
73B
•
Updated
•
7
EmbeddedLLM/ELLM_Star
EmbeddedLLM/bge-m3-int4-sym-ov
EmbeddedLLM/bge-m3-int4-ov
Updated
•
40
•
1
EmbeddedLLM/Qwen2.5-32B-Instruct-int4-sym-ov
Updated
•
13
EmbeddedLLM/Qwen2.5-14B-Instruct-int4-sym-ov
EmbeddedLLM/vLLM-AMD-flash-attn-debug
Updated
EmbeddedLLM/Llama-Guard-3-1B-int4-sym-ov
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-sym-ov
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-sym-ov
EmbeddedLLM/Llama-Guard-3-1B-int4-asym-ov
Updated
•
42
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-asym-ov
Updated
•
10
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-asym-ov
Updated
•
10
EmbeddedLLM/Qwen2.5-7B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-3B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-1.5B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-0.5B-Instruct-int4-sym-ov
EmbeddedLLM/Llama-3.1-8B-Instruct-int4-asym-ov
EmbeddedLLM/Llama-3.1-70B-Instruct-int4-asym-ov
EmbeddedLLM/Phi-3.5-vision-instruct-int4-ov
Updated
•
35