See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 2 hours ago
ZhangShenao/bigmath_chat
published
a dataset
about 2 hours ago
ZhangShenao/bigmath_chat
liked
a dataset
about 13 hours ago
SynthLabsAI/Big-Math-RL-Verified
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 122 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 12 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 13 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 22
models
371
ZhangShenao/bt_math_gsm-Mistral-7B-Instruct-v0.2_7500_temp_1.0_gen_1_mlr5e-5_reverse
Updated
•
2
ZhangShenao/bt_math_gsm-gemma-1.1-7b-it_7500_temp_1.0_gen_1_mlr5e-5_reverse
Updated
•
13
ZhangShenao/bt_math_gsm-Meta-Llama-3-8B-Instruct_7500_temp_1.0_gen_1_mlr5e-5
Updated
•
2
ZhangShenao/bt_math_gsm-Mistral-7B-Instruct-v0.2_7500_temp_1.0_gen_1_mlr5e-5
Updated
•
3
ZhangShenao/bt_sft_math_gsm-gemma-1.1-7b-it-sft-sample_7500_tp_mlr_5e-5
Updated
•
2
ZhangShenao/bt_math_gsm-gemma-1.1-7b-it_7500_temp_1.0_gen_1_mlr5e-5
Updated
•
12
ZhangShenao/bt_math_gsm-Mistral-7B-Instruct-v0.2-sft-sample_7500_tp_mlr_5e-5
Updated
•
5
ZhangShenao/math_gsm-gemma-2-9b-it-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Updated
•
5
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Updated
•
6
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Updated
•
6
datasets
230
ZhangShenao/bigmath_chat
Viewer
•
Updated
•
251k
ZhangShenao/bt-math_gsm-Meta-Llama-3-8B-Instruct-iter_sample_7500_temp_1.0_gen_1_mlr5e-5
Viewer
•
Updated
•
5.43k
•
2
ZhangShenao/bt-math_gsm-Mistral-7B-Instruct-v0.2-iter_sample_7500_temp_1.0_gen_1_mlr5e-5
Viewer
•
Updated
•
3.05k
•
5
ZhangShenao/bt_sft-math_gsm-gemma-1.1-7b-it-iter_sample_7500_tp
Viewer
•
Updated
•
7.47k
•
2
ZhangShenao/bt-math_gsm-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_1_mlr5e-5
Viewer
•
Updated
•
3.74k
•
17
ZhangShenao/bt_sft-math_gsm-Mistral-7B-Instruct-v0.2-iter_sample_7500_tp
Viewer
•
Updated
•
7.47k
•
32
ZhangShenao/rs_nnew-math_gsm-gemma-2-9b-it-iter_sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Viewer
•
Updated
•
3.57k
•
30
ZhangShenao/rs_nnew-math_gsm-Mistral-7B-Instruct-v0.2-iter_sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Viewer
•
Updated
•
113
•
35
ZhangShenao/rs_nnew-math_gsm-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Viewer
•
Updated
•
758
•
35
ZhangShenao/rs_nnew-math_gsm-Meta-Llama-3-8B-Instruct-iter_sample_7500_temp_1.0_gen_30_mlr5e-5_ex
Viewer
•
Updated
•
5.13k
•
32