MasterControlAIML/DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora Text Generation • 8B • Updated Jun 18 • 13
MasterControlAIML/DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Merged-Lora-16bit Text Generation • 8B • Updated Jun 18 • 11
MasterControlAIML/DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora-gguf 8B • Updated Jun 18 • 47
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Paper • 2502.14905 • Published Feb 18 • 9
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Paper • 2502.14905 • Published Feb 18 • 9
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Paper • 2502.14905 • Published Feb 18 • 9
MasterControlAIML/DeepSeek-R1-Qwen2.5-1.5b-SFT-R1-JSON-Unstructured-To-Structured Text Generation • 2B • Updated Feb 25 • 297 • 11
MasterControlAIML/DeepSeek-R1-Qwen2.5-1.5b-SFT-R1-JSON-Unstructured-To-Structured-GGUF 2B • Updated Feb 7 • 65 • 1
MasterControlAIML/DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured Text Generation • 2B • Updated Feb 3 • 30 • 4
MasterControlAIML/DeepSeek-R1-Qwen-2.5-1.5b-Latest-Unstructured-To-Structured Text Generation • Updated Feb 3 • 1.12k • 5
MasterControlAIML/Qwen2.5-7b-Answer-Distractor-MCQ-Generation-merged-16bit Text Generation • 8B • Updated Jan 31 • 1