snap-stanford/dtwin_Medium_gpt4o_eval_10-2
Viewer
• Updated • 200 • 4
snap-stanford/dtwin_Medium_gpt4o_eval_6-1
Viewer
• Updated • 200 • 4
snap-stanford/dtwin_Medium_gpt4o_eval_6-2
Viewer
• Updated • 200 • 2
snap-stanford/dtwin_Medium_gpt4o_eval_10
Viewer
• Updated • 200 • 2
snap-stanford/dtwin_Medium_gpt4o_eval_0
Viewer
• Updated • 200 • 2
snap-stanford/dtwin_Medium
Viewer
• Updated • 1.01k • 8
snap-stanford/pubmed_system
Viewer
• Updated • 1.93k • 8
snap-stanford/hotpotqa_system
Viewer
• Updated • 13k • 7
snap-stanford/pubmed_pipeline-preference_scorer-new
Viewer
• Updated • 1.09k • 27
snap-stanford/bigcodebench_three_agents_pipeline-preference_scorer
Viewer
• Updated • 526 • 32
snap-stanford/hotpotqa_four_agents_pipeline-preference_modular_model_prior-bak
Viewer
• Updated • 3.4k • 28
snap-stanford/preference_iterative_hard
Viewer
• Updated • 717 • 5
Viewer
• Updated • 33.9k • 2.6k
• 10