This collection contains safetyQA dataset for safe SPIN training and trained models
Yifan Wang
AmberYifan
AI & ML interests
None yet
Organizations
Collections
1
models
94
AmberYifan/Qwen2-7B-spin-2k
Updated
•
2
AmberYifan/Qwen2-7B-dpo-2k
Updated
•
2
AmberYifan/Mistral-7B-v0.3-spin-2k-hhrlhf
Updated
•
6
AmberYifan/Mistral-7B-v0.3-gen-dpo-2k-hhrlhf
Updated
•
6
AmberYifan/Mistral-7B-v0.3-dpo-2k-hhrlhf
Updated
•
7
AmberYifan/Mistral-7B-v0.1-spin-2k-hhrlhf
Updated
•
6
AmberYifan/Mistral-7B-v0.1-gen-dpo-2k-hhrlhf
Updated
•
6
AmberYifan/Mistral-7B-v0.1-dpo-2k-hhrlhf
Updated
•
6
AmberYifan/Qwen2.5-7B-spin-2k
Updated
•
6
AmberYifan/Qwen2.5-7B-gen-dpo-2k
Updated
•
6
datasets
25
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
AmberYifan/sft-spin-filter
Updated
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
16
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
159
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
AmberYifan/spin-v-diverse
Viewer
•
Updated
•
55k
AmberYifan/dpo-v
Viewer
•
Updated
•
55k
•
19
AmberYifan/spin-v
Viewer
•
Updated
•
55k