arxiv:2405.07863
Wei Xiong
weqweasdas
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
about 5 hours ago
selfcorrexp/distill_40koldc2r_120kw_84kcorr
published
a dataset
about 5 hours ago
selfcorrexp/distill_40koldc2r_120kw_84kcorr
updated
a dataset
about 5 hours ago
selfcorrexp/distill_0kc2r_120kw_84kcorr
Organizations
models
23
weqweasdas/zephyr-7b-dpo-full
Text Generation
•
Updated
•
7
weqweasdas/zephyr-7b-gemma-dpo
Updated
weqweasdas/zephyr-7b-sft-full
Updated
weqweasdas/zephyr-7b-dpo-qlora
Updated
weqweasdas/gpt2-cpt-dutch
Text Generation
•
Updated
•
65
weqweasdas/zephyr-7b-gemma-sft
Updated
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085
Text Generation
•
Updated
•
4
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6
Text Generation
•
Updated
•
5
weqweasdas/raft_baseline_zephyr_packing_model6
Text Generation
•
Updated
•
4
weqweasdas/raft_baseline_openchat_llama13b_model1
Text Generation
•
Updated
•
6
datasets
172
weqweasdas/ace_processed
Viewer
•
Updated
•
5.18M
weqweasdas/llama31_70b_chosen_type12_mix
Viewer
•
Updated
•
21.5k
•
15
weqweasdas/prompt_math_test
Viewer
•
Updated
•
15k
•
19
weqweasdas/fixed05_llasft_math_7ktype2_7ktype3_ver2_150_tmp10_generation_with_rewards
Viewer
•
Updated
•
30k
•
30
weqweasdas/filtered_numia_prompt15k
Viewer
•
Updated
•
15k
•
18
weqweasdas/filtered_numia_prompt30k
Viewer
•
Updated
•
30.6k
•
17
weqweasdas/prompt_numinamath
Viewer
•
Updated
•
119k
•
19
weqweasdas/prompt_numinamath_with_gts
Viewer
•
Updated
•
168k
•
16
weqweasdas/fixed05_llasft_math_3ktype2_7ktype3_ver2_250_tmp10_generation_with_rewards
Viewer
•
Updated
•
50k
•
18
weqweasdas/fixed05_llasft_math_3ktype2_7ktype3_ver2_250_more_datatmp10_vllmexp_retest2_generation
Viewer
•
Updated
•
50k
•
16