snap-stanford/dtwin_Medium_gpt4o_eval_10
Viewer
• Updated • 200 • 9
snap-stanford/dtwin_Medium_gpt4o_eval_0
Viewer
• Updated • 200 • 8
snap-stanford/dtwin_Medium
Viewer
• Updated • 1.01k • 6
snap-stanford/pubmed_system
Viewer
• Updated • 1.93k • 4
snap-stanford/hotpotqa_system
Viewer
• Updated • 13k • 16
snap-stanford/pubmed_pipeline-preference_scorer-new
Viewer
• Updated • 1.09k • 3
snap-stanford/bigcodebench_three_agents_pipeline-preference_scorer
Viewer
• Updated • 526 • 3
snap-stanford/hotpotqa_four_agents_pipeline-preference_modular_model_prior-bak
Viewer
• Updated • 3.4k • 5
snap-stanford/preference_iterative_hard
Viewer
• Updated • 717 • 3
Viewer
• Updated • 33.9k • 1.05k
• 9