Commit History

Add WebSocket reconnection and increased timeouts for HF Space
8eea9e0
Running

Yatin Taneja commited on

Fix outer exception [END] log format to match spec
e05e3bc

Yatin Taneja commited on

Pass task_level to reset() for correct difficulty when using HF Space
f66e7fa

Yatin Taneja commited on

Fix: fallback to HF Space when LOCAL_IMAGE_NAME not set (judges don't set it)
15adff9

Yatin Taneja commited on

Reorder endpoints to match OpenEnv spec, add Gemma-4-26B to scores
7fbef9e

Yatin Taneja commited on

Restore HF README with YAML header
7beb3d5

Yatin Taneja commited on

Clean README for GitHub (no HF YAML header)
1beb236

Yatin Taneja commited on

Add Gemma-4-26B baseline scores and trajectory proof
81b3988

Yatin Taneja commited on

Add OpenAI sync import for checklist compliance
1a48b28

Yatin Taneja commited on

Fix: restore LOCAL_IMAGE_NAME default, wrap main in try/except
0e8fe0f

Yatin Taneja commited on

Fix: match submission checklist — defaults only for API_BASE_URL and MODEL_NAME
618a908

Yatin Taneja commited on

Fix: open README link in new tab (iframe blocked)
82e8f3a

Yatin Taneja commited on

Clean up baseline table formatting, inline hackathon note
cb1131f

Yatin Taneja commited on

Update landing page: scientific description + README link
9e44826

Yatin Taneja commited on

Elevate README: scientific abstract, real-world applications, hackathon acknowledgment
3c6347c

Yatin Taneja commited on

Fix: deterministic language in baseline scores
77c0e6a

Yatin Taneja commited on

Update baseline scores: Gemma-4-31B, Qwen 3.6+, MiniMax M2.7
7d7329a

Yatin Taneja commited on

Add baseline trajectory logs as proof (Gemma-4-31B, Qwen 3.6+, MiniMax M2.7)
a358efd

Yatin Taneja commited on

Tighten DIC grader: require critical flagging + reference range queries
e771999

Yatin Taneja commited on

Fix: accept task_level in reset() for proper level selection via API/WS
5f7ffe3

Yatin Taneja commited on