mradermacher/ReForm-14B-RL-entropy-GGUF Reinforcement Learning • 15B • Updated about 1 month ago • 63