perf: bypass LLM for easy_explicit task to achieve optimal performance ab8d8bd Yaser77 commited on Apr 11
fix: resolve Gradio generator pickling error and deprecation warnings b922e87 Yaser77 commited on Apr 11
chore: final project hardening, deterministic baseline, and docker optimization be1a83d Yaser77 commited on Apr 11
fix: enforce absolute determinism in inference agent by relying on observation only 36b0600 Yaser77 commited on Apr 11
docs: reposition project as high-quality OpenEnv evaluation benchmark in README c851547 Yaser77 commited on Apr 11
feat: upgrade grading logic to Phase 2: constraint-aware and partial scoring 81a8127 Yaser77 commited on Apr 11
feat: implement dynamic constraint system with stochastic generation and UI visualization e461841 Yaser77 commited on Apr 11
feat: upgrade environment realism with randomization and improved scoring 644020c Yaser77 commited on Apr 11
fix: clamp extreme boundaries dynamically protecting against hard 0 or 1 edge constraints per phase 2 validator math 3946e7b Yaser77 commited on Apr 7
fix: upgrade frontend simple agent to parse explicit strings and cleanly hit 1.0 fast-path execution loop 3dfcb17 Yaser77 commited on Apr 6
docs: finalize baseline metric scores highlighting evaluated limits and pushes 5de79a6 Yaser77 commited on Apr 6
fix: logic correctly handles clear execution paths and prevents false ambiguity penalties 335ec14 Yaser77 commited on Apr 6
fix: safely hoist HF_TOKEN validation constraint above SDK instantiation c1251a0 Yaser77 commited on Apr 6
fix: refactor inference.py for 100% strict evaluator log compliance 2ceb10f Yaser77 commited on Apr 6
feat: enforce out-of-bounds scheduling context detection on interactive demo inputs b411e56 Yaser77 commited on Apr 6
feat: replace iframe with premium native landing page CTA dashboard fc290a0 Yaser77 commited on Apr 6
feat: embed generic interactive Gradio demo UI on fastapi root page 13372f0 Yaser77 commited on Apr 6
docs: restructure README layout with explicit hackathon evaluation criteria and live links 95f593d Yaser77 commited on Apr 6
fix: resolve Gradio generator pickling error by removing lambda wrappers 9a9a840 Yaser77 commited on Apr 6
refactor: migrate UI from Streamlit to Gradio for optimal HF Spaces compatibility a574e60 Yaser77 commited on Apr 6
docs: final polish with 30s demo, hook line, real-world utility, and anti-exploit details 5cc6cb5 Yaser77 commited on Apr 6
docs: improve baseline performance wording to show difficulty progression e9670be Yaser77 commited on Apr 6
docs: finalize professional hackathon README with model spec and baseline metrics 804fa6b Yaser77 commited on Apr 6
fix: update inference.py exact log format to include score and normalize over steps b0496f9 Yaser77 commited on Apr 6