ml-debug-env / inference.py

Commit History

Block A B C: partial observability, LLM judge, adversarial scheduler
49aa3ca

rak2315 commited on

v3: compound tasks, hardened graders, other type, 8 tasks total
6d9a8b2

rak2315 commited on

add 6 tasks, fix log format, multi-turn retry, grader improvements
4108ae8

rak2315 commited on

fix: inference.py calls LLM proxy directly
e749fdf

rak2315 commited on

fix: 20/20 all tasks 1.0
63eddc8

rak2315 commited on

fix: use API_BASE_URL and API_KEY env vars for LLM proxy
645efc4

rak2315 commited on

fix: self-contained inference.py, no network dependency
2a87ebe

rak2315 commited on

fix: emit [START]/[STEP]/[END] structured output for Phase 2 validator
d92195b

rak2315 commited on

Fix inference.py to hit deployed HF Space baseline endpoint
ff42cc0

rak2315 commited on

Add inference.py for hackathon checker
8abdf62

rak2315 commited on