Predict human preference to LLM responses.
Bill Xu
billxbf
AI & ML interests
None yet
Recent Activity
published
a dataset
7 days ago
billxbf/aimo-hard-bilingual
updated
a model
27 days ago
billxbf/nemo-sft-orpo
published
a model
27 days ago
billxbf/nemo-sft-orpo
Organizations
None yet
Collections
1
models
8

billxbf/nemo-sft-orpo
Updated
•
15

billxbf/chai-nemo13b-sft-orpo-merge_v2
Text Generation
•
Updated
•
23

billxbf/chai-nemo-sft-orpo-merge
Text Generation
•
Updated
•
26

billxbf/wsdm-qwen14b_dare_dslerp-gptq-q4
Text Classification
•
Updated
•
18

billxbf/phi4_4k_dare
Text Classification
•
Updated
•
17

billxbf/bulla_7b
Updated
•
5

billxbf/mmos-deepseek-math-7b
Text Generation
•
Updated
•
19

billxbf/specialized-rewoo-planner-7b
Updated
datasets
7
billxbf/aimo-hard-bilingual
Updated
•
4
billxbf/lmsys61k
Viewer
•
Updated
•
110k
•
73
billxbf/ppt127k
Viewer
•
Updated
•
127k
•
68
billxbf/arxiv_dump
Viewer
•
Updated
•
11.1k
•
157
•
1
billxbf/yfdump_5m
Viewer
•
Updated
•
5.18M
•
78
billxbf/rewoo-instruction-finetuning
Viewer
•
Updated
•
2.04k
•
81
•
2
billxbf/sotu2023-qa
Viewer
•
Updated
•
876
•
101