GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e4_bs_128_chosen_sample-reject_won_1723695467 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e5_bs_128_chosen_sample-reject_won_1723676144 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_1e-6_eta_1e5_bs_128_chosen_bon-reject_sample_1723723799 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e3_bs_128_chosen_bon-reject_sample_1723704986 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_1e-6_eta_1e5_bs_128_chosen_sample-reject_won_1723733496 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e5_bs_128_chosen_bon-reject_sample_1723666850 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e3_bs_128_chosen_sample-reject_won_1723714453 Text Generation • 8B • Updated Aug 15, 2024 • 6
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e4_bs_128_chosen_bon-reject_sample_1723685865 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e4_bs_128_eps_40000_40k_agg_1723718604 Text Generation • 8B • Updated Aug 15, 2024 • 16
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e6_bs_128_eps_40000_40k_agg_1723705488 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_5e-7_eta_1e5_bs_128_eps_40000_40k_agg_1723712089 Text Generation • 8B • Updated Aug 15, 2024 • 16
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e4_bs_128_eps_40000_40k_agg_1723698931 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e6_bs_128_eps_40000_40k_agg_1723685865 Text Generation • 8B • Updated Aug 15, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e5_bs_128_eps_40000_40k_agg_1723692345 Text Generation • 8B • Updated Aug 15, 2024 • 17
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e6_bs_128_chosen_bon-reject_sample_1723646241 Text Generation • 8B • Updated Aug 14, 2024 • 19
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e6_bs_128_chosen_sample-reject_won_1723655721 Text Generation • 8B • Updated Aug 14, 2024 • 21
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e7_bs_128_chosen_bon-reject_sample_1723627360 Text Generation • 8B • Updated Aug 14, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e7_bs_128_chosen_sample-reject_won_1723636754 Text Generation • 8B • Updated Aug 14, 2024 • 7
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e8_bs_128_chosen_sample-reject_won_1723600892 Text Generation • 8B • Updated Aug 14, 2024 • 15
GitBag/rebel_armo_OneBatch_lr_3e-7_eta_1e8_bs_128_chosen_bon-reject_sample_1723574410 Text Generation • 8B • Updated Aug 14, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e5_bs_128_eps_iter_1_1723514196 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e3_bs_128_eps_iter_1_1723533237 Text Generation • 8B • Updated Aug 13, 2024 • 19
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e4_bs_128_iter_2_1723535554 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e4_bs_128_eps_iter_1_1723523769 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e2_bs_128_iter_2_1723555887 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e6_bs_128_eps_iter_1_1723498352 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e3_bs_128_iter_2_1723545015 Text Generation • 8B • Updated Aug 13, 2024 • 7
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e6_bs_128_iter_2_1723498936 Text Generation • 8B • Updated Aug 13, 2024 • 18
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e5_bs_128_iter_2_1723526004 Text Generation • 8B • Updated Aug 13, 2024 • 15
GitBag/rebel_ultrafeedback_armo_OneBatch_newprob_full_lr_3e-7_eta_1e6_bs_128_iter_1_1723057670 Updated Aug 8, 2024