Spaces:

Prajwal782007
/

Gridmind

Sleeping

App Files Files Community

Commit History

docs: rewrite and expand blog post describing GridMind-RL environment architecture and results

ea1e214

Prajwal782007 commited on 22 days ago

Please provide the file changes or a description of the code modifications so I can generate the commit message for you.

3857109

Prajwal782007 commited on 22 days ago

Final cleanup: condense blog post and align README with hackathon judging criteria

79d1279

Prajwal782007 commited on 22 days ago

docs: update HF_BLOG_POST.md with OpenEnv interface details, reward engineering insights, and refined training results

fb9da93

Prajwal782007 commited on 22 days ago

updated Blog and renamed curves

51aa9cd

ShreeshantXD commited on 22 days ago

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

70d6600

Prajwal782007 commited on 22 days ago

docs: remove links section from main.go HTML response

2f90681

Prajwal782007 commited on 22 days ago

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

ddca261

adityss commited on 22 days ago

feat: add training curve images for iteration 10

4f56bad

adityss commited on 22 days ago

fix: remove broken OpenEnv Specification link from documentation footer

a8aa990

Prajwal782007 commited on 22 days ago

Updated Readme for ROund 2

52635ef

ShreeshantXD commited on 22 days ago

docs: add training baseline, reward, and loss curve plots for multiple experimental runs

28abef0

adityss commited on 22 days ago

feat: add GridMind-RL GRPO training notebook for industrial energy management

87ce30f

adityss commited on 22 days ago

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

8b17b2a

Prajwal782007 commited on 22 days ago

feat: make GitHub link open in new tab

8078e50

Prajwal782007 commited on 22 days ago

update: links

9b9f2d1

ShreeshantXD commited on 22 days ago

feat: make Google Colab link open in new tab

8240ff4

Prajwal782007 commited on 22 days ago

feat: add Google Colab link to dashboard

b39d65e

Prajwal782007 commited on 22 days ago

feat: implement Unsloth GRPO training pipeline with environment-backed reward functions and balanced dataset generation

27d3504

adityss commited on 23 days ago

fix: disable AMP for quantized models to avoid gradient scaler issues in GRPO training

19ba2eb

adityss commited on 23 days ago

feat: update GRPO training configuration with additional parameters for logging and precision

f3ecc94

adityss commited on 23 days ago

Adjust logging configuration for training: log every step, enable completion metrics, and limit completions printed per step.

a6b45e9

adityss commited on 23 days ago

feat: add GridMind GRPO training notebook for multi-theme reinforcement learning

999605c

Prajwal782007 commited on 23 days ago

feat: add GRPO training notebook for GridMind-RL environment

505323f

Prajwal782007 commited on 23 days ago

feat: add GridMind GRPO training notebook for Meta PyTorch OpenEnv hackathon

29b9cd0

Prajwal782007 commited on 23 days ago

feat: add GridMind GRPO training notebook for industrial energy management environment

9d42d14

Prajwal782007 commited on 23 days ago

feat: implement GridMind-RL training pipeline with GRPO Colab notebook and Unsloth configuration script

b0701ef

Prajwal782007 commited on 23 days ago

feat: implement Unsloth GRPO training script with environment-based reward tracking and balanced dataset generation

32d5b8f

Prajwal782007 commited on 23 days ago

feat: add GRPO training pipeline for GridMind-RL environment via Unsloth and TRL

26e9b86

Prajwal782007 commited on 23 days ago

feat: add submission validator script and GRPO training notebook, and update Python version requirement to >=3.10

7d89faf

Prajwal782007 commited on 23 days ago

feat: add GridMind GRPO training environment and Unsloth training script

3d49e8a

Prajwal782007 commited on 23 days ago

feat: add script to migrate max_new_tokens from GRPOConfig to GRPOTrainer in notebook

08731ee

Prajwal782007 commited on 23 days ago

fix: change tokenizer to processing_class in GRPOTrainer

acabf6c

Prajwal782007 commited on 23 days ago

fix: order imports in Step 1 and add missing torch import in Step 7

c220c03

Prajwal782007 commited on 23 days ago

fix: add missing sys import in Step 1 cell

84396d2

Prajwal782007 commited on 23 days ago

fix: move max_new_tokens from GRPOConfig to GRPOTrainer generation_kwargs

dc14955

Prajwal782007 commited on 23 days ago

fix: enforce GPU usage and assertions in colab notebook

4738130

Prajwal782007 commited on 23 days ago

feat: add script to patch gridmind_grpo_colab.ipynb with torch import in cell 5e5826e4

f2d83b9

Prajwal782007 commited on 23 days ago

fix: add missing torch import in Step 4 cell

c77108e

Prajwal782007 commited on 23 days ago

feat: add script to automate TRL dependency updates in the GRPO Colab notebook

e9f731a

Prajwal782007 commited on 23 days ago

fix: update dependencies in colab notebook for GRPOTrainer

7597057

Prajwal782007 commited on 23 days ago

fix: install trl from git to get latest GRPOTrainer

a3862f1

Prajwal782007 commited on 23 days ago

chore: add script to update trl dependency version in gridmind_grpo_colab.ipynb

af8efb9

Prajwal782007 commited on 23 days ago

fix: update TRL requirement to support GRPOTrainer

ffe65ee

Prajwal782007 commited on 23 days ago

feat: update HF space URL, add judge demo scripts and project documentation

a4bc605

Prajwal782007 commited on 23 days ago

fix: update health check endpoint in GridMind notebook and provide utility script to apply fix

18750f8

Prajwal782007 commited on 23 days ago

feat: add GRPO training notebook for GridMind-RL environment

e890cbb

adityss commited on 23 days ago

refactor: replace heuristic log generation with Go-based environment simulation and update API schema

3b977fc

adityss commited on 23 days ago

Add coordinator endpoint tests and project readiness verification script

88da572

adityss commited on 23 days ago

feat: enable huggingface-skills plugin in cursor settings

e531486

adityss commited on 23 days ago