reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-thinking-20637061 Viewer • Updated 13 days ago • 10 • 13
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-thinking-429343f5 Viewer • Updated 13 days ago • 10 • 14
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-thin-fea1b895 Viewer • Updated 13 days ago • 10 • 14
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-instruct-b54be491 Viewer • Updated 13 days ago • 10 • 13
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-thin-e3fa99d5 Viewer • Updated 13 days ago • 10 • 12
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-thin-1d8d6196 Viewer • Updated 13 days ago • 10 • 14
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-inst-0898857d Viewer • Updated 13 days ago • 10 • 12
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-inst-97d2a5df Viewer • Updated 13 days ago • 10 • 11
reasoning-degeneration-dev/wingdings-swebench-verified-1.0-mini-swe-agent-Qwen3-Next-80B-A3B-Instruct-20260222 Viewer • Updated 13 days ago • 10 • 34
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-moonshotai-kimi-k2-thinking-9910c637 Viewer • Updated 13 days ago • 10 • 20
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-moonshotai-kimi-k2-instruct-78cc6a2e Viewer • Updated 13 days ago • 10 • 21
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-thinking-bef08885 Viewer • Updated 13 days ago • 10 • 16
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-instruct-e271b21c Viewer • Updated 13 days ago • 10 • 17
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-thin-9106273f Viewer • Updated 13 days ago • 10 • 21
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-inst-6ad757f1 Viewer • Updated 13 days ago • 10 • 21
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-thin-9f99c03d Viewer • Updated 13 days ago • 10 • 24
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-moonshotai-kimi-k2-instruct-addc05ee Viewer • Updated 13 days ago • 10 • 14
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-inst-04958e25 Viewer • Updated 13 days ago • 10 • 24
reasoning-degeneration-dev/t1-wingdings-arena-frozenlake-together_ai-moonshotai-kimi-k2-thinking-4636da73 Viewer • Updated 13 days ago • 10 • 13
reasoning-degeneration-dev/t1-wingdings-countdown-kimi-k2-instruct Viewer • Updated 13 days ago • 10 • 12
reasoning-degeneration-dev/t1-wingdings-countdown-together_ai-moonshotai-kimi-k2-thinking-61281244 Viewer • Updated 13 days ago • 10 • 9
reasoning-degeneration-dev/t1-wingdings-musr-murder-together_ai-moonshotai-kimi-k2-thinking-8c1b04d5 Viewer • Updated 13 days ago • 10 • 9
reasoning-degeneration-dev/t1-wingdings-arena-frozenlake-kimi-k2-instruct Viewer • Updated 13 days ago • 10 • 10
reasoning-degeneration-dev/t1-wingdings-musr-murder-kimi-k2-instruct Viewer • Updated 13 days ago • 10 • 12
reasoning-degeneration-dev/t1-strategy-musr-critfirst-qwen3-80b-thinking-musr-cf Viewer • Updated 13 days ago • 10 • 16
reasoning-degeneration-dev/t1-strategy-musr-counterfactual-qwen3-80b-thinking-musr-cfact Viewer • Updated 13 days ago • 10 • 14
reasoning-degeneration-dev/t1-strategy-musr-anti-qwen3-80b-thinking-musr-cf Viewer • Updated 13 days ago • 10 • 16
reasoning-degeneration-dev/t1-strategy-musr-baseline-qwen3-80b-thinking-musr-base Viewer • Updated 13 days ago • 10 • 17
reasoning-degeneration-dev/t1-strategy-musr-anti-counterfactual-qwen3-80b-thinking-musr-cfact Viewer • Updated 13 days ago • 10 • 12
reasoning-degeneration-dev/t1-strategy-musr-counterfactual-kimi-k2-thinking-musr-cfact Viewer • Updated 13 days ago • 10 • 12