refactor: replace heuristic log generation with Go-based environment simulation and update API schema 3b977fc adityss commited on 23 days ago
feat: implement Go-based GridMind-RL simulation core and update inference interface (graph) a4671c4 Prajwal782007 commited on 24 days ago
feat: define GridMind-RL environment data models and task structures c009bc5 adityss commited on 24 days ago
feat: implement multi-component dense reward function and environmental logic for GridMind-RL b81683f adityss commited on 24 days ago
Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation 0af208b adityss commited on 26 days ago
fix: clamp scores after rounding and ensure all sub-scores are clamped e58b5ec ShreeshantXD commited on Apr 7
fix: clamp all scores to open interval (0, 1) to meet validator requirements ef0556b ShreeshantXD commited on Apr 7
feat: implement core environment simulation logic and update baseline scores 5569b4d adityss commited on Apr 4
docs: add reward structure and deployment guide, update baseline scores, and implement dashboard UI e3fbc9c Prajwal782007 commited on Apr 3
feat: add baseline scores JSON, inference script, and update Dockerfile for improved project structure 6d74982 ShreeshantXD commited on Apr 2
Enhance dashboard: Live Simulation, 72h episodes, and step reward tracking curve 4c1963b adityss commited on Apr 1