AI & ML interests
None defined yet.
PolarisEvals/leaderboard-data
Viewer
• Updated • 1.14M • 116
PolarisEvals/llm_dataset_completness_2stage_score_mini
Viewer
• Updated • 10 • 3
PolarisEvals/llm_dataset_completness_2stage_score
Viewer
• Updated • 54.3k • 4
PolarisEvals/llm_dataset_completness_2stage_justification_score
Viewer
• Updated • 54.3k • 14
PolarisEvals/llm_dataset_completness_2stage
Viewer
• Updated • 54.3k • 3
PolarisEvals/shikib_dataset_completeness_2stage_unittest
Viewer
• Updated • 5.47k • 28
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug
Viewer
• Updated • 100 • 4
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response
Viewer
• Updated • 5.47k • 4
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest
Viewer
• Updated • 912 • 3
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug
Viewer
• Updated • 100 • 2
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts
Viewer
• Updated • 912 • 6
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug
Viewer
• Updated • 100 • 5
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions
Viewer
• Updated • 982 • 3
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated • 100 • 14
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated • 100 • 5
PolarisEvals/training_criteria_dpo_distill
Viewer
• Updated • 912 • 10
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated • 100 • 3
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated • 100 • 4
PolarisEvals/synqa_hudson_300_samples
Viewer
• Updated • 1.5k • 5
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated • 100 • 7
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_True
Viewer
• Updated • 10 • 9
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_False
Viewer
• Updated • 10 • 6
PolarisEvals/synqa_hudson_300_queries_rubrics_score
Viewer
• Updated • 7.5k • 10
PolarisEvals/synqa_hudson_300_samples_gpt-4-0613_outputs
Viewer
• Updated • 81 • 2