Spaces:

Prajwal782007
/

Gridmind

Running

App Files Files Community

Gridmind

Commit History

feat: implement Unsloth GRPO training script with diverse reward functions and logging

d2449aa

adityss commited on Apr 25

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

c2917d2

ShreeshantXD commited on Apr 25

remove binary from tracking

5713634

ShreeshantXD commited on Apr 25

feat: enable huggingface-skills plugin in cursor settings

e531486

adityss commited on Apr 25

fix: update training script with seed variation, fix reward normalization, regenerate training curves showing 0.52->0.67 improvement

bdc9954

adityss commited on Apr 25

feat: add scripts/full_demo.py — unified 10-step demo proving all 4 hackathon themes operational

5636c9d

adityss commited on Apr 25

docs: add HF blog post draft for community posting

2256ed6

adityss commited on Apr 25

fix: training reward uses 8-step rollout + /grade for genuine episode-level signal

c70e17d

adityss commited on Apr 25

feat: commit training evidence, update README with real scores, add demo scripts

8204dc0

adityss commited on Apr 25

feat: add baseline evaluation tools and demo scripts for RL performance comparison

c395f6a

adityss commited on Apr 25

chore: ignore ML outputs, models, and AI agent artifacts

e517002

adityss commited on Apr 25

fix: include env/faults.go missed due to .gitignore

61fd4d1

adityss commited on Apr 25

fix: add missing faults.go file causing HF space build error

6785dd4

adityss commited on Apr 25

feat: add GridMind GRPO training notebook using Unsloth and HF TRL

bdadba1

adityss commited on Apr 25

feat: implement Go-based GridMind-RL simulation core and update inference interface (graph)

a4671c4

Prajwal782007 commited on Apr 25

docs: add reward weight rationale table to README

74dc7b5

adityss commited on Apr 25

feat: add multi-agent and planning CLI flags to inference and expose environment metadata via /info endpoint

ebe8fa5

adityss commited on Apr 25

feat: define GridMind-RL environment data models and task structures

c009bc5

adityss commited on Apr 25

feat: implement multi-component dense reward function and environmental logic for GridMind-RL

b81683f

adityss commited on Apr 25

Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation

0af208b

adityss commited on Apr 22

Add GridMind GRPO training notebook for Colab

fd2ceda

ShreeshantXD commited on Apr 22

fix: Replace ineffective break with return in WebSocket close handler

d012f99

ShreeshantXD commited on Apr 21

feat: Add full OpenEnv compliance

4ec4472

ShreeshantXD commited on Apr 21

refactor: Inference and readme

f020509

ShreeshantXD commited on Apr 8

updated readme

a4be35d

ShreeshantXD commited on Apr 8

rewrite reward system

32565e1

ShreeshantXD commited on Apr 8

fix llm based reward output

2ede269

ShreeshantXD commited on Apr 8

fix: problems with parameters

f316664

ShreeshantXD commited on Apr 8

fix: reward system

8242dc3

ShreeshantXD commited on Apr 8

fix: introduce SCORE_EPSILON and clamp scores in run_episode and main functions

b93cee3

adityss commited on Apr 7

fix: clamp scores after rounding and ensure all sub-scores are clamped

e58b5ec

ShreeshantXD commited on Apr 7

fix: clamp all scores to open interval (0, 1) to meet validator requirements

ef0556b

ShreeshantXD commited on Apr 7

fix: mirror all base images via gcr.io

db76306

ShreeshantXD commited on Apr 7

fix: use gcr.io mirror to avoid Docker Hub rate limits

7e27f89

ShreeshantXD commited on Apr 7

fix: use golang:1.21 instead of alpine for better Docker registry compatibility

287d2a3

ShreeshantXD commited on Apr 7

fix: auto-start env server in inference.py

30aa472

ShreeshantXD commited on Apr 7

refactor: update default model and API endpoint, enhance error handling, and add close method for compatibility

891cc5b

adityss commited on Apr 7

Merge branch 'origin/main' with local changes overriding

fd3e9b6

adityss commited on Apr 7

fix: provide fallback API key and add safety check for empty observations in inference client

fe2f8c9

adityss commited on Apr 7

Sync root inference.py with fixed python/inference.py

588b24a

ShreeshantXD commited on Apr 7

Fix inference.py: handle missing API key gracefully, wrap all exceptions

9fd03cb

ShreeshantXD commited on Apr 7

fix: add 'server' entry point for OpenEnv validator

427e52b

ShreeshantXD commited on Apr 6

fix: add server entry point and pyproject scripts for OpenEnv validator

91cc891

ShreeshantXD commited on Apr 6

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

0d301ce

adityss commited on Apr 5

Fix UTF-8 encoding for landing page emojis

f75687e

ShreeshantXD commited on Apr 5

Fix GitHub URL in landing page

1838d55

ShreeshantXD commited on Apr 5

Fix supervisord permissions for HF Spaces non-root user

13a69d0

ShreeshantXD commited on Apr 5

Add root landing page handler with links to dashboard and API endpoints

90c0e10

ShreeshantXD commited on Apr 5

Fix dashboard paths for /dashboard reverse proxy

832f069

ShreeshantXD commited on Apr 5

Add dashboard reverse proxy at /dashboard

84fb786

ShreeshantXD commited on Apr 5

Commit History

feat: implement Unsloth GRPO training script with diverse reward functions and logging d2449aa

Merge branch 'main' of https://github.com/LO-Kyu/gridmind c2917d2

remove binary from tracking 5713634

feat: enable huggingface-skills plugin in cursor settings e531486

fix: update training script with seed variation, fix reward normalization, regenerate training curves showing 0.52->0.67 improvement bdc9954

feat: add scripts/full_demo.py — unified 10-step demo proving all 4 hackathon themes operational 5636c9d

docs: add HF blog post draft for community posting 2256ed6

fix: training reward uses 8-step rollout + /grade for genuine episode-level signal c70e17d

feat: commit training evidence, update README with real scores, add demo scripts 8204dc0

feat: add baseline evaluation tools and demo scripts for RL performance comparison c395f6a

chore: ignore ML outputs, models, and AI agent artifacts e517002

fix: include env/faults.go missed due to .gitignore 61fd4d1

fix: add missing faults.go file causing HF space build error 6785dd4

feat: add GridMind GRPO training notebook using Unsloth and HF TRL bdadba1

feat: implement Go-based GridMind-RL simulation core and update inference interface (graph) a4671c4

docs: add reward weight rationale table to README 74dc7b5

feat: add multi-agent and planning CLI flags to inference and expose environment metadata via /info endpoint ebe8fa5

feat: define GridMind-RL environment data models and task structures c009bc5

feat: implement multi-component dense reward function and environmental logic for GridMind-RL b81683f

Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation 0af208b

Add GridMind GRPO training notebook for Colab fd2ceda

fix: Replace ineffective break with return in WebSocket close handler d012f99

feat: Add full OpenEnv compliance 4ec4472

refactor: Inference and readme f020509

updated readme a4be35d

rewrite reward system 32565e1

fix llm based reward output 2ede269

fix: problems with parameters f316664

fix: reward system 8242dc3

fix: introduce SCORE_EPSILON and clamp scores in run_episode and main functions b93cee3

fix: clamp scores after rounding and ensure all sub-scores are clamped e58b5ec

fix: clamp all scores to open interval (0, 1) to meet validator requirements ef0556b

fix: mirror all base images via gcr.io db76306

fix: use gcr.io mirror to avoid Docker Hub rate limits 7e27f89

fix: use golang:1.21 instead of alpine for better Docker registry compatibility 287d2a3

fix: auto-start env server in inference.py 30aa472

refactor: update default model and API endpoint, enhance error handling, and add close method for compatibility 891cc5b

Merge branch 'origin/main' with local changes overriding fd3e9b6

fix: provide fallback API key and add safety check for empty observations in inference client fe2f8c9

Sync root inference.py with fixed python/inference.py 588b24a

Fix inference.py: handle missing API key gracefully, wrap all exceptions 9fd03cb

fix: add 'server' entry point for OpenEnv validator 427e52b

fix: add server entry point and pyproject scripts for OpenEnv validator 91cc891

Merge branch 'main' of https://github.com/LO-Kyu/gridmind 0d301ce

Fix UTF-8 encoding for landing page emojis f75687e

Fix GitHub URL in landing page 1838d55

Fix supervisord permissions for HF Spaces non-root user 13a69d0

Add root landing page handler with links to dashboard and API endpoints 90c0e10

Fix dashboard paths for /dashboard reverse proxy 832f069

Add dashboard reverse proxy at /dashboard 84fb786

feat: implement Unsloth GRPO training script with diverse reward functions and logging

d2449aa

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

c2917d2

remove binary from tracking

5713634

feat: enable huggingface-skills plugin in cursor settings

e531486

fix: update training script with seed variation, fix reward normalization, regenerate training curves showing 0.52->0.67 improvement

bdc9954

feat: add scripts/full_demo.py — unified 10-step demo proving all 4 hackathon themes operational

5636c9d

docs: add HF blog post draft for community posting

2256ed6

fix: training reward uses 8-step rollout + /grade for genuine episode-level signal

c70e17d

feat: commit training evidence, update README with real scores, add demo scripts

8204dc0

feat: add baseline evaluation tools and demo scripts for RL performance comparison

c395f6a

chore: ignore ML outputs, models, and AI agent artifacts

e517002

fix: include env/faults.go missed due to .gitignore

61fd4d1

fix: add missing faults.go file causing HF space build error

6785dd4

feat: add GridMind GRPO training notebook using Unsloth and HF TRL

bdadba1

feat: implement Go-based GridMind-RL simulation core and update inference interface (graph)

a4671c4

docs: add reward weight rationale table to README

74dc7b5

feat: add multi-agent and planning CLI flags to inference and expose environment metadata via /info endpoint

ebe8fa5

feat: define GridMind-RL environment data models and task structures

c009bc5

feat: implement multi-component dense reward function and environmental logic for GridMind-RL

b81683f

Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation

0af208b

Add GridMind GRPO training notebook for Colab

fd2ceda

fix: Replace ineffective break with return in WebSocket close handler

d012f99

feat: Add full OpenEnv compliance

4ec4472

refactor: Inference and readme

f020509

updated readme

a4be35d

rewrite reward system

32565e1

fix llm based reward output

2ede269

fix: problems with parameters

f316664

fix: reward system

8242dc3

fix: introduce SCORE_EPSILON and clamp scores in run_episode and main functions

b93cee3

fix: clamp scores after rounding and ensure all sub-scores are clamped

e58b5ec

fix: clamp all scores to open interval (0, 1) to meet validator requirements

ef0556b

fix: mirror all base images via gcr.io

db76306

fix: use gcr.io mirror to avoid Docker Hub rate limits

7e27f89

fix: use golang:1.21 instead of alpine for better Docker registry compatibility

287d2a3

fix: auto-start env server in inference.py

30aa472

refactor: update default model and API endpoint, enhance error handling, and add close method for compatibility

891cc5b

Merge branch 'origin/main' with local changes overriding

fd3e9b6

fix: provide fallback API key and add safety check for empty observations in inference client

fe2f8c9

Sync root inference.py with fixed python/inference.py

588b24a

Fix inference.py: handle missing API key gracefully, wrap all exceptions

9fd03cb

fix: add 'server' entry point for OpenEnv validator

427e52b

fix: add server entry point and pyproject scripts for OpenEnv validator

91cc891

Merge branch 'main' of https://github.com/LO-Kyu/gridmind

0d301ce

Fix UTF-8 encoding for landing page emojis

f75687e

Fix GitHub URL in landing page

1838d55

Fix supervisord permissions for HF Spaces non-root user

13a69d0

Add root landing page handler with links to dashboard and API endpoints

90c0e10

Fix dashboard paths for /dashboard reverse proxy

832f069

Add dashboard reverse proxy at /dashboard

84fb786