feat: add GridMind-RL GRPO training notebook for industrial energy management 87ce30f adityss commited on Apr 26
feat: implement Unsloth GRPO training pipeline with environment-backed reward functions and balanced dataset generation 27d3504 adityss commited on Apr 26
fix: disable AMP for quantized models to avoid gradient scaler issues in GRPO training 19ba2eb adityss commited on Apr 26
feat: update GRPO training configuration with additional parameters for logging and precision f3ecc94 adityss commited on Apr 26
Adjust logging configuration for training: log every step, enable completion metrics, and limit completions printed per step. a6b45e9 adityss commited on Apr 26
feat: add GridMind GRPO training notebook for multi-theme reinforcement learning 999605c Prajwal782007 commited on Apr 26
feat: add GRPO training notebook for GridMind-RL environment 505323f Prajwal782007 commited on Apr 25
feat: add GridMind GRPO training notebook for Meta PyTorch OpenEnv hackathon 29b9cd0 Prajwal782007 commited on Apr 25
feat: add GridMind GRPO training notebook for industrial energy management environment 9d42d14 Prajwal782007 commited on Apr 25
feat: implement GridMind-RL training pipeline with GRPO Colab notebook and Unsloth configuration script b0701ef Prajwal782007 commited on Apr 25
feat: add GRPO training pipeline for GridMind-RL environment via Unsloth and TRL 26e9b86 Prajwal782007 commited on Apr 25
feat: add submission validator script and GRPO training notebook, and update Python version requirement to >=3.10 7d89faf Prajwal782007 commited on Apr 25
feat: add GridMind GRPO training environment and Unsloth training script 3d49e8a Prajwal782007 commited on Apr 25
fix: order imports in Step 1 and add missing torch import in Step 7 c220c03 Prajwal782007 commited on Apr 25
fix: move max_new_tokens from GRPOConfig to GRPOTrainer generation_kwargs dc14955 Prajwal782007 commited on Apr 25
feat: update HF space URL, add judge demo scripts and project documentation a4bc605 Prajwal782007 commited on Apr 25
fix: update health check endpoint in GridMind notebook and provide utility script to apply fix 18750f8 Prajwal782007 commited on Apr 25
Add coordinator endpoint tests and project readiness verification script 88da572 adityss commited on Apr 25
fix: update training script with seed variation, fix reward normalization, regenerate training curves showing 0.52->0.67 improvement bdc9954 adityss commited on Apr 25
feat: add GridMind GRPO training notebook using Unsloth and HF TRL bdadba1 adityss commited on Apr 25
Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation 0af208b adityss commited on Apr 22