RedHatAI/starcoder2-15b-FP8
Text Generation
•
Updated
•
19
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w8a16
Text Generation
•
Updated
•
70
RedHatAI/Meta-Llama-3.1-8B-quantized.w8a16
Text Generation
•
Updated
•
54
•
1
RedHatAI/Meta-Llama-3.1-70B-FP8
Text Generation
•
Updated
•
1.74k
•
2
RedHatAI/Mistral-Large-Instruct-2407-FP8
Text Generation
•
Updated
•
12
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
1.62k
•
5
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
131k
•
43
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w8a8
Text Generation
•
Updated
•
147
•
2
RedHatAI/Qwen2-72B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
370
•
1
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
28
RedHatAI/Qwen2-7B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
45
RedHatAI/Phi-3-medium-128k-instruct-quantized.w4a16
Text Generation
•
Updated
•
5.47k
•
3
RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
1.74k
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16
Text Generation
•
Updated
•
36
•
1
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
1.71k
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
•
Updated
•
78
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
5.39k
•
2
RedHatAI/Llama-2-7b-chat-quantized.w8a8
Text Generation
•
Updated
•
2.8k
•
1
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a16
Text Generation
•
Updated
•
28
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
•
Updated
•
27
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
2.59k
•
3
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
20.6k
•
3
RedHatAI/gemma-2-9b-it-quantized.w8a8
Text Generation
•
Updated
•
34
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
•
Updated
•
36
•
2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16
Text Generation
•
Updated
•
23
•
2
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
•
Updated
•
32
•
5
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a16
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a16
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a16
RedHatAI/Qwen2.5-72B-Instruct-quantized.w8a8