Max
reciprocate
AI & ML interests
Reward models
Recent Activity
liked
a model
27 days ago
Qwen/QwQ-32B-Preview
Organizations
reciprocate's activity
fix(readme): rename `map` -> `filter` in code for selecting subset
#3 opened 8 months ago
by
reciprocate
change mt bench plot
#1 opened about 1 year ago
by
reciprocate
is it reward model? how can we use it?
1
#1 opened over 1 year ago
by
Asaf-Yehudai