NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28 • 919 • 10
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 4.29k • 183
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-GGUF Reinforcement Learning • 8B • Updated May 5 • 23 • 2
MarioBarbeque/dqn-SpaceInvadersNoFrameskip-v4 Reinforcement Learning • Updated about 3 hours ago • 17 • 1