docs: rewrite and expand blog post describing GridMind-RL environment architecture and results ea1e214 Prajwal782007 commited on 22 days ago
Please provide the file changes or a description of the code modifications so I can generate the commit message for you. 3857109 Prajwal782007 commited on 22 days ago
Final cleanup: condense blog post and align README with hackathon judging criteria 79d1279 Prajwal782007 commited on 22 days ago
docs: update HF_BLOG_POST.md with OpenEnv interface details, reward engineering insights, and refined training results fb9da93 Prajwal782007 commited on 22 days ago
Merge branch 'main' of https://github.com/LO-Kyu/gridmind 70d6600 Prajwal782007 commited on 22 days ago
fix: remove broken OpenEnv Specification link from documentation footer a8aa990 Prajwal782007 commited on 22 days ago
docs: add training baseline, reward, and loss curve plots for multiple experimental runs 28abef0 adityss commited on 22 days ago
feat: add GridMind-RL GRPO training notebook for industrial energy management 87ce30f adityss commited on 22 days ago
Merge branch 'main' of https://github.com/LO-Kyu/gridmind 8b17b2a Prajwal782007 commited on 22 days ago
feat: implement Unsloth GRPO training pipeline with environment-backed reward functions and balanced dataset generation 27d3504 adityss commited on 23 days ago
fix: disable AMP for quantized models to avoid gradient scaler issues in GRPO training 19ba2eb adityss commited on 23 days ago
feat: update GRPO training configuration with additional parameters for logging and precision f3ecc94 adityss commited on 23 days ago
Adjust logging configuration for training: log every step, enable completion metrics, and limit completions printed per step. a6b45e9 adityss commited on 23 days ago
feat: add GridMind GRPO training notebook for multi-theme reinforcement learning 999605c Prajwal782007 commited on 23 days ago
feat: add GRPO training notebook for GridMind-RL environment 505323f Prajwal782007 commited on 23 days ago
feat: add GridMind GRPO training notebook for Meta PyTorch OpenEnv hackathon 29b9cd0 Prajwal782007 commited on 23 days ago
feat: add GridMind GRPO training notebook for industrial energy management environment 9d42d14 Prajwal782007 commited on 23 days ago
feat: implement GridMind-RL training pipeline with GRPO Colab notebook and Unsloth configuration script b0701ef Prajwal782007 commited on 23 days ago
feat: implement Unsloth GRPO training script with environment-based reward tracking and balanced dataset generation 32d5b8f Prajwal782007 commited on 23 days ago
feat: add GRPO training pipeline for GridMind-RL environment via Unsloth and TRL 26e9b86 Prajwal782007 commited on 23 days ago
feat: add submission validator script and GRPO training notebook, and update Python version requirement to >=3.10 7d89faf Prajwal782007 commited on 23 days ago
feat: add GridMind GRPO training environment and Unsloth training script 3d49e8a Prajwal782007 commited on 23 days ago
feat: add script to migrate max_new_tokens from GRPOConfig to GRPOTrainer in notebook 08731ee Prajwal782007 commited on 23 days ago
fix: change tokenizer to processing_class in GRPOTrainer acabf6c Prajwal782007 commited on 23 days ago
fix: order imports in Step 1 and add missing torch import in Step 7 c220c03 Prajwal782007 commited on 23 days ago
fix: move max_new_tokens from GRPOConfig to GRPOTrainer generation_kwargs dc14955 Prajwal782007 commited on 23 days ago
fix: enforce GPU usage and assertions in colab notebook 4738130 Prajwal782007 commited on 23 days ago
feat: add script to patch gridmind_grpo_colab.ipynb with torch import in cell 5e5826e4 f2d83b9 Prajwal782007 commited on 23 days ago
feat: add script to automate TRL dependency updates in the GRPO Colab notebook e9f731a Prajwal782007 commited on 23 days ago
fix: update dependencies in colab notebook for GRPOTrainer 7597057 Prajwal782007 commited on 23 days ago
chore: add script to update trl dependency version in gridmind_grpo_colab.ipynb af8efb9 Prajwal782007 commited on 23 days ago
feat: update HF space URL, add judge demo scripts and project documentation a4bc605 Prajwal782007 commited on 23 days ago
fix: update health check endpoint in GridMind notebook and provide utility script to apply fix 18750f8 Prajwal782007 commited on 23 days ago
refactor: replace heuristic log generation with Go-based environment simulation and update API schema 3b977fc adityss commited on 23 days ago
Add coordinator endpoint tests and project readiness verification script 88da572 adityss commited on 23 days ago