Commit History

feat: implement Unsloth GRPO training script with diverse reward functions and logging
d2449aa

adityss commited on

Merge branch 'main' of https://github.com/LO-Kyu/gridmind
c2917d2

ShreeshantXD commited on

remove binary from tracking
5713634

ShreeshantXD commited on

feat: enable huggingface-skills plugin in cursor settings
e531486

adityss commited on

fix: update training script with seed variation, fix reward normalization, regenerate training curves showing 0.52->0.67 improvement
bdc9954

adityss commited on

feat: add scripts/full_demo.py — unified 10-step demo proving all 4 hackathon themes operational
5636c9d

adityss commited on

docs: add HF blog post draft for community posting
2256ed6

adityss commited on

fix: training reward uses 8-step rollout + /grade for genuine episode-level signal
c70e17d

adityss commited on

feat: commit training evidence, update README with real scores, add demo scripts
8204dc0

adityss commited on

feat: add baseline evaluation tools and demo scripts for RL performance comparison
c395f6a

adityss commited on

chore: ignore ML outputs, models, and AI agent artifacts
e517002

adityss commited on

fix: include env/faults.go missed due to .gitignore
61fd4d1

adityss commited on

fix: add missing faults.go file causing HF space build error
6785dd4

adityss commited on

feat: add GridMind GRPO training notebook using Unsloth and HF TRL
bdadba1

adityss commited on

feat: implement Go-based GridMind-RL simulation core and update inference interface (graph)
a4671c4

Prajwal782007 commited on

docs: add reward weight rationale table to README
74dc7b5

adityss commited on

feat: add multi-agent and planning CLI flags to inference and expose environment metadata via /info endpoint
ebe8fa5

adityss commited on

feat: define GridMind-RL environment data models and task structures
c009bc5

adityss commited on

feat: implement multi-component dense reward function and environmental logic for GridMind-RL
b81683f

adityss commited on

Add Task 4 instruction following, Curriculum Manager for self-improvement, and world modeling simulation
0af208b

adityss commited on

Add GridMind GRPO training notebook for Colab
fd2ceda

ShreeshantXD commited on

fix: Replace ineffective break with return in WebSocket close handler
d012f99

ShreeshantXD commited on

feat: Add full OpenEnv compliance
4ec4472

ShreeshantXD commited on

refactor: Inference and readme
f020509

ShreeshantXD commited on

rewrite reward system
32565e1

ShreeshantXD commited on

fix llm based reward output
2ede269

ShreeshantXD commited on

fix: problems with parameters
f316664

ShreeshantXD commited on

fix: introduce SCORE_EPSILON and clamp scores in run_episode and main functions
b93cee3

adityss commited on

fix: clamp scores after rounding and ensure all sub-scores are clamped
e58b5ec

ShreeshantXD commited on

fix: clamp all scores to open interval (0, 1) to meet validator requirements
ef0556b

ShreeshantXD commited on

fix: mirror all base images via gcr.io
db76306

ShreeshantXD commited on

fix: use gcr.io mirror to avoid Docker Hub rate limits
7e27f89

ShreeshantXD commited on

fix: use golang:1.21 instead of alpine for better Docker registry compatibility
287d2a3

ShreeshantXD commited on

fix: auto-start env server in inference.py
30aa472

ShreeshantXD commited on

refactor: update default model and API endpoint, enhance error handling, and add close method for compatibility
891cc5b

adityss commited on

Merge branch 'origin/main' with local changes overriding
fd3e9b6

adityss commited on

fix: provide fallback API key and add safety check for empty observations in inference client
fe2f8c9

adityss commited on

Sync root inference.py with fixed python/inference.py
588b24a

ShreeshantXD commited on

Fix inference.py: handle missing API key gracefully, wrap all exceptions
9fd03cb

ShreeshantXD commited on

fix: add 'server' entry point for OpenEnv validator
427e52b

ShreeshantXD commited on

fix: add server entry point and pyproject scripts for OpenEnv validator
91cc891

ShreeshantXD commited on

Merge branch 'main' of https://github.com/LO-Kyu/gridmind
0d301ce

adityss commited on

Fix UTF-8 encoding for landing page emojis
f75687e

ShreeshantXD commited on

Fix GitHub URL in landing page
1838d55

ShreeshantXD commited on

Fix supervisord permissions for HF Spaces non-root user
13a69d0

ShreeshantXD commited on

Add root landing page handler with links to dashboard and API endpoints
90c0e10

ShreeshantXD commited on

Fix dashboard paths for /dashboard reverse proxy
832f069

ShreeshantXD commited on

Add dashboard reverse proxy at /dashboard
84fb786

ShreeshantXD commited on