Function Calling PO Dataset Function Calling Preference Optimization Datasets orion-research/Aura-Tooling-DPO-v3 Viewer • Updated Dec 7, 2024 • 3.8k • 24 GreenNode/RLHF_glaive_toolcall_en Viewer • Updated Jan 7 • 1.23k • 17 AymanTarig/Qwen2.5-0.5B-FC-v1-mistakes-critiques Viewer • Updated Nov 20, 2024 • 9.71k • 14 roborovski/glaive-tool-usage-dpo Viewer • Updated Feb 29, 2024 • 42k • 29 • 2
RLHF + Code Vezora/Code-Preference-Pairs Viewer • Updated Jul 28, 2024 • 54k • 204 • 25 quangduc1112001/python-code-DPO-fine-tune Viewer • Updated Nov 4, 2024 • 2k • 54 • 2 xinlai/Math-Step-DPO-10K Viewer • Updated Jul 4, 2024 • 10.8k • 414 • 56 minfeng-ai/leetcode_preference Viewer • Updated Sep 6, 2023 • 457 • 105 • 7
Function Calling PO Dataset Function Calling Preference Optimization Datasets orion-research/Aura-Tooling-DPO-v3 Viewer • Updated Dec 7, 2024 • 3.8k • 24 GreenNode/RLHF_glaive_toolcall_en Viewer • Updated Jan 7 • 1.23k • 17 AymanTarig/Qwen2.5-0.5B-FC-v1-mistakes-critiques Viewer • Updated Nov 20, 2024 • 9.71k • 14 roborovski/glaive-tool-usage-dpo Viewer • Updated Feb 29, 2024 • 42k • 29 • 2
RLHF + Code Vezora/Code-Preference-Pairs Viewer • Updated Jul 28, 2024 • 54k • 204 • 25 quangduc1112001/python-code-DPO-fine-tune Viewer • Updated Nov 4, 2024 • 2k • 54 • 2 xinlai/Math-Step-DPO-10K Viewer • Updated Jul 4, 2024 • 10.8k • 414 • 56 minfeng-ai/leetcode_preference Viewer • Updated Sep 6, 2023 • 457 • 105 • 7