Commit History

docs: rewrite and expand blog post describing GridMind-RL environment architecture and results
ea1e214

Prajwal782007 commited on

Please provide the file changes or a description of the code modifications so I can generate the commit message for you.
3857109

Prajwal782007 commited on

Final cleanup: condense blog post and align README with hackathon judging criteria
79d1279

Prajwal782007 commited on

docs: update HF_BLOG_POST.md with OpenEnv interface details, reward engineering insights, and refined training results
fb9da93

Prajwal782007 commited on

updated Blog and renamed curves
51aa9cd

ShreeshantXD commited on

Merge branch 'main' of https://github.com/LO-Kyu/gridmind
70d6600

Prajwal782007 commited on

docs: remove links section from main.go HTML response
2f90681

Prajwal782007 commited on

Merge branch 'main' of https://github.com/LO-Kyu/gridmind
ddca261

adityss commited on

feat: add training curve images for iteration 10
4f56bad

adityss commited on

fix: remove broken OpenEnv Specification link from documentation footer
a8aa990

Prajwal782007 commited on

Updated Readme for ROund 2
52635ef

ShreeshantXD commited on

docs: add training baseline, reward, and loss curve plots for multiple experimental runs
28abef0

adityss commited on

feat: add GridMind-RL GRPO training notebook for industrial energy management
87ce30f

adityss commited on

Merge branch 'main' of https://github.com/LO-Kyu/gridmind
8b17b2a

Prajwal782007 commited on

feat: make GitHub link open in new tab
8078e50

Prajwal782007 commited on

feat: make Google Colab link open in new tab
8240ff4

Prajwal782007 commited on

feat: add Google Colab link to dashboard
b39d65e

Prajwal782007 commited on

feat: implement Unsloth GRPO training pipeline with environment-backed reward functions and balanced dataset generation
27d3504

adityss commited on

fix: disable AMP for quantized models to avoid gradient scaler issues in GRPO training
19ba2eb

adityss commited on

feat: update GRPO training configuration with additional parameters for logging and precision
f3ecc94

adityss commited on

Adjust logging configuration for training: log every step, enable completion metrics, and limit completions printed per step.
a6b45e9

adityss commited on

feat: add GridMind GRPO training notebook for multi-theme reinforcement learning
999605c

Prajwal782007 commited on

feat: add GRPO training notebook for GridMind-RL environment
505323f

Prajwal782007 commited on

feat: add GridMind GRPO training notebook for Meta PyTorch OpenEnv hackathon
29b9cd0

Prajwal782007 commited on

feat: add GridMind GRPO training notebook for industrial energy management environment
9d42d14

Prajwal782007 commited on

feat: implement GridMind-RL training pipeline with GRPO Colab notebook and Unsloth configuration script
b0701ef

Prajwal782007 commited on

feat: implement Unsloth GRPO training script with environment-based reward tracking and balanced dataset generation
32d5b8f

Prajwal782007 commited on

feat: add GRPO training pipeline for GridMind-RL environment via Unsloth and TRL
26e9b86

Prajwal782007 commited on

feat: add submission validator script and GRPO training notebook, and update Python version requirement to >=3.10
7d89faf

Prajwal782007 commited on

feat: add GridMind GRPO training environment and Unsloth training script
3d49e8a

Prajwal782007 commited on

feat: add script to migrate max_new_tokens from GRPOConfig to GRPOTrainer in notebook
08731ee

Prajwal782007 commited on

fix: change tokenizer to processing_class in GRPOTrainer
acabf6c

Prajwal782007 commited on

fix: order imports in Step 1 and add missing torch import in Step 7
c220c03

Prajwal782007 commited on

fix: add missing sys import in Step 1 cell
84396d2

Prajwal782007 commited on

fix: move max_new_tokens from GRPOConfig to GRPOTrainer generation_kwargs
dc14955

Prajwal782007 commited on

fix: enforce GPU usage and assertions in colab notebook
4738130

Prajwal782007 commited on

feat: add script to patch gridmind_grpo_colab.ipynb with torch import in cell 5e5826e4
f2d83b9

Prajwal782007 commited on

fix: add missing torch import in Step 4 cell
c77108e

Prajwal782007 commited on

feat: add script to automate TRL dependency updates in the GRPO Colab notebook
e9f731a

Prajwal782007 commited on

fix: update dependencies in colab notebook for GRPOTrainer
7597057

Prajwal782007 commited on

fix: install trl from git to get latest GRPOTrainer
a3862f1

Prajwal782007 commited on

chore: add script to update trl dependency version in gridmind_grpo_colab.ipynb
af8efb9

Prajwal782007 commited on

fix: update TRL requirement to support GRPOTrainer
ffe65ee

Prajwal782007 commited on

feat: update HF space URL, add judge demo scripts and project documentation
a4bc605

Prajwal782007 commited on

fix: update health check endpoint in GridMind notebook and provide utility script to apply fix
18750f8

Prajwal782007 commited on

feat: add GRPO training notebook for GridMind-RL environment
e890cbb

adityss commited on

refactor: replace heuristic log generation with Go-based environment simulation and update API schema
3b977fc

adityss commited on

Add coordinator endpoint tests and project readiness verification script
88da572

adityss commited on

feat: enable huggingface-skills plugin in cursor settings
e531486

adityss commited on