EntropyEnv / inference.py

Commit History

chore: Apply Bug #2 and Bug #3 strict min/max bound clamping to prevent out of range scores and fix windows encoding
ee547a6
Running

immortalindeed commited on

Final strict spec-compliance polish: score precision, empty rewards, updated test assertions
6284048

immortalindeed commited on

Fix syntax of [END] STDOUT line to perfectly match Hackathon mandatory format with score= parameter
f96532b

immortalindeed commited on

Fix: abort [END] lines use rewards=0.01 instead of empty rewards= to prevent evaluator 0.0 score
723407b

immortalindeed commited on

Spec-compliance overhaul: remove difficulty_multiplier, weighted blend scoring, dep_hard fix, [END] format
f3fd4ef

immortalindeed commited on

Skip benchmark store on fatal API errors (402/401/403)
1ecd7e1

immortalindeed commited on

Fix dep_hard Counter bug, add fatal error handling, update README with 14-model benchmark
3466d21

immortalindeed commited on

Fix state machine bugs and switch to average scoring for discriminative benchmarking
cd5104a

immortalindeed commited on

Fix score aggregation: use max(rewards) for discriminative multi-turn scoring
fe9aa5c

immortalindeed commited on

Remove rate limiter (blocks evaluator) and fix score aggregation to clamped sum
3dfb5fe

immortalindeed commited on

fix(benchmark): Hardening multi-agent environment and strict score compliance
6f95f2a

immortalindeed commited on

Clamp scores strictly to (0.01, 0.99) to pass OpenEnv Phase 2 continuous environment score verification checks
829f543

immortalindeed commited on

Revert incorrect log parsing changes and fix reward summation logic
d270d2a

immortalindeed commited on

Fix log formatting to exactly match diagnostic feedback
b4f20cf

immortalindeed commited on

Fix benchmark output saving: add results dir and print errors
9bb611a

immortalindeed commited on

Fix HF_TOKEN parsing for strict validation
46acf43

immortalindeed commited on