PolarisEvals/llm_dataset_completness_2stage_justification_score Viewer • Updated Jun 13, 2024 • 54.3k • 66
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug Viewer • Updated Jun 12, 2024 • 100 • 70
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response Viewer • Updated Jun 12, 2024 • 5.47k • 48
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest Viewer • Updated Jun 11, 2024 • 912 • 51
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug Viewer • Updated Jun 11, 2024 • 100 • 47
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts Viewer • Updated Jun 11, 2024 • 912 • 48
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug Viewer • Updated Jun 5, 2024 • 100 • 61
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions Viewer • Updated Jun 5, 2024 • 982 • 59
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 4, 2024 • 100 • 56
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 3, 2024 • 100 • 78
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 59
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 46
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 49