view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 171
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 29 days ago • 1.42k • 5
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 29 days ago • 1.42k • 5
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 29 days ago • 215 • 2
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 29 days ago • 215 • 2
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 29 days ago • 215 • 2
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 29 days ago • 319 • 4
AlicanKiraz0/SenecaLLM_x_Qwen2.5-7B-CyberSecurity Text Generation • Updated about 1 month ago • 191 • 4
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 29 days ago • 319 • 4
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 29 days ago • 319 • 4