chore: update inference script and core logic for alignment 6a6a0f9 ashishMenon05 commited on 29 days ago
fix: add _clamp method to base_grader and update all graders to use it for strict 0-1 range 0375ef1 ashishMenon05 commited on 29 days ago
fix: clamp all scores in reward_engine and environment to be strictly between 0 and 1 eeb6990 ashishMenon05 commited on 29 days ago
fix: adjust easy grader criteria so max score is 0.98 (not 1.0) 2eb124c ashishMenon05 commited on 29 days ago
fix: ensure all reward values are strictly between 0 and 1 ed4fc09 ashishMenon05 commited on 29 days ago
fix: ensure grader scores are strictly between 0 and 1 (not 0.0 or 1.0) 4379b47 ashishMenon05 commited on 29 days ago
add: openenv-core dependency required for hackathon evaluator 3e2ef4e ashishMenon05 commited on 29 days ago
feat: remove multi-agent verdict panel, keep unified summary only ae4346b ashishMenon05 commited on 29 days ago
build: update frontend deployment assets with latest UI changes 2142915 ashishMenon05 commited on 29 days ago
feat: add unified investigation summary combining all agent conclusions c90dd9f ashishMenon05 commited on 29 days ago
build: update production distribution assets for multi-agent support f309cb4 ashishMenon05 commited on 29 days ago
feat: unlimited agents support with fixed build configuration cb0d29a ashishMenon05 commited on 29 days ago
fix(frontend): correct syntax errors in SettingsView for Tailwind v4 compatibility 68fba27 ashishMenon05 commited on 29 days ago
chore: update dependencies and environment configuration for unlimited agent support 26f67bb ashishMenon05 commited on 29 days ago
feat(ui): support unlimited agents with scalable terminal grid and scrollable settings 7fbbd40 ashishMenon05 commited on 29 days ago
fix(backend): fix syntax error in reset handler and properly flush UI terminal logs 174293c ashishMenon05 commited on 29 days ago
fix(frontend): dynamically map multi-agent message streams in websocket hook and dashboard render 3a45de3 ashishMenon05 commited on 29 days ago
fix(backend): resolve hardcoded agent_a reference in simulation loop startup 9ffd769 ashishMenon05 commited on 29 days ago
fix(backend): resolve AttributeError in episode reset payload 399ee59 ashishMenon05 commited on 29 days ago
feat: complete multi-agent refactor and robust scoring updates 9fdf940 ashishMenon05 commited on 29 days ago
fix: full resubmission patch - fix [STEP] format, add close(), expose system_state, fix /state endpoint, improve reward variance fd5d7f9 ashishMenon05 commited on 29 days ago
refactor: improve environment variable handling for API authentication - 18:47 3168d77 ashishMenon05 commited on Apr 8
fix(inference): lazily initialize OpenAI client to support test runner mocks 43956b9 Your Name commited on Apr 8
fix(inference): exact match of OpenAI client init for AST validators 354e4bd Your Name commited on Apr 8
fix(inference): perfect alignment with pre-submission checklist defaults a22ca25 Your Name commited on Apr 8
final: absolute comprehensive synchronization - capturing server fixes - 00:51 0243041 Antigravity commited on Apr 7
final: definitive release synchronization including pyproject.toml - 00:43 4f14584 Antigravity commited on Apr 7
final: release synchronization - refined reset endpoint state logic - 00:38 ecbd542 Antigravity commited on Apr 7
final: definitive release synchronization with defensive state fixes - timestamp 00:33 e5624f5 Antigravity commited on Apr 7
final: 15th definitive synchronization - refined UI logic and state helpers - timestamp 2026-04-07 23:38:07 54f4814 Antigravity commited on Apr 7
final: definitive release synchronization with UI button visibility logic - timestamp 2026-04-07 23:31:05 670934a Antigravity commited on Apr 7
fix: add robust safety checks for agent.messages (fixes non-iterable crash) b133b4c Antigravity commited on Apr 7