DevQuasar/ibm-granite.granite-3.0-8b-lora-intrinsics-v0.1-GGUF Text Generation • Updated about 13 hours ago • 45
DevQuasar/huihui-ai.Llama-3.3-70B-Instruct-abliterated-finetuned-GGUF Text Generation • Updated 1 day ago • 320 • 1
DevQuasar/HuggingFaceTB.finemath-ablation-infiwebmath-GGUF Text Generation • Updated 2 days ago • 121
DevQuasar/HuggingFaceTB.finemath-ablation-infiwebmath-3plus-GGUF Text Generation • Updated 2 days ago • 131
DevQuasar/HuggingFaceTB.finemath-ablation-finemath-infimath-3plus-GGUF Text Generation • Updated 2 days ago • 127
view post Post 1187 tiiuae Falcon3 10B Q8 playground: DevQuasar/Mi50Also find my tiiuae Falcon3 Quant collection here:https://huggingface.co/collections/DevQuasar/tiiuae-falcon3-676236626f3c57d1a19c6c1d Enjoy! See translation 🚀 3 3 + Reply
view post Post 4460 The AMD Instinct MI50 (~$110) is surprisingly fast for inference Quantized models. This runs a Llama 3.1 8B Q8 with Llama.cpp DevQuasar/Mi50A little blogpost about the HWhttp://devquasar.com/uncategorized/amd-radeon-instinct-mi50-cheap-inference/ See translation 👍 16 16 🔥 1 1 + Reply