Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ankner
's Collections
Base Models With Chat Templates
Hydra Decoding
Oracle 2 Proxy Models
Oracle 2 Proxy Data
Multi Judgement Oversight
Critique-out-Loud Reward Models
Oracle 2 Proxy Data
updated
Jan 21
Upvote
-
ankner/gsm8k-CoT
Viewer
•
Updated
Jan 17
•
8.78k
•
370
•
1
ankner/gsm8k-sft
Viewer
•
Updated
Jan 19
•
1.1k
•
127
•
1
ankner/gsm8k-rl
Viewer
•
Updated
Jan 19
•
7.68k
•
561
ankner/apps-sft
Viewer
•
Updated
Jan 12
•
3.51k
•
81
ankner/apps-rl
Viewer
•
Updated
Jan 21
•
5.25k
•
302
ankner/apps-rl-deepseek-7b-inst-labeled
Viewer
•
Updated
Jan 13
•
5.25k
•
51
ankner/chat-pref
Viewer
•
Updated
Jan 17
•
39.7k
•
57
Upvote
-
Share collection
View history
Collection guide
Browse collections