sevzero / inference.py

Commit History

Fix Phase 2 timeout: cap LLM retries, move server wait to main, add global time guard
382d0fd

Mist-ic commited on

Fix Phase 2: add server readiness check and wrap all httpx calls in try/except
b971f92

Mist-ic commited on

Fix inference.py to match sample script exactly: HF_TOKEN var name, no fallback
33fb36b

Mist-ic commited on

Fix Phase 2: add [START]/[STEP]/[END] structured output + simulation fixes
524b287

Mist-ic commited on

Increase max_tokens to 4000 for thinking model compatibility
ff0696e

Mist-ic commited on

Add baseline results, outputs dir, and backup Groq fallback
67e0b04

Mist-ic commited on

Improve inference script robustness and fallback strategy
641125f

Mist-ic commited on

Fix HTTP endpoint statefulness and response format
99b8b51

Mist-ic commited on

Add baseline inference script and Dockerfile
00225fe

Mist-ic commited on