Spaces:

ademarteau
/

RL-Inventory-Simulations

Runtime error

App Files Files Community

RL-Inventory-Simulations

Commit History

chore: untrack server/pycache files

4ed16c1

ademarteau commited on Mar 8

fix: use correct HF router URL /hf-inference/v1/ (not /models/{id}/v1/)

9a9473a

ademarteau commited on Mar 8

fix: COGS profit model - charge unit cost on sold units not on orders, preventing end-of-period bias against high-inventory strategies

18aac4f

ademarteau commited on Mar 8

fix: update HF proxy URL to router.huggingface.co (api-inference deprecated 410)

38aa642

ademarteau commited on Mar 8

fix: serve index.html with no-cache headers to prevent CDN/browser stale cache

2cee429

ademarteau commited on Mar 8

fix: proxy HF Inference API through FastAPI to bypass HF Spaces CSP

c3fc8d4

ademarteau commited on Mar 8

metrics: profit first, then service level, then fill rate (React UI)

e7f1f53

ademarteau commited on Mar 8

fix: use minimal requirements-server.txt in Docker to avoid downloading torch/ML deps

b2065cc

ademarteau commited on Mar 8

fix: add package-lock.json for npm ci in Docker build

0209151

ademarteau commited on Mar 8

fix: switch HF Space SDK from gradio to docker

4992459

ademarteau commited on Mar 8

feat: replace Gradio with React UI — GRPO tab, 730-day sim, 200-entry memory bank

b413222

ademarteau commited on Mar 8

refactor: remove Unsloth, use standard transformers + PEFT

355b2d5

RishbhaJain Claude Sonnet 4.6 commited on Mar 8

fix: use nvidia/cuda devel base image so vllm can build with CUDA toolkit

b52921e

Arvind Sreenivas commited on Mar 8

fix: pin torch 2.6.0 + xformers 0.0.29, use Python 3.12 for ML compat

d73e520

Arvind Sreenivas commited on Mar 8

fix: install torch, xformers, vllm before requirements to avoid build failures

ea9e7b6

Arvind Sreenivas commited on Mar 8

fix: install torch before vllm/xformers to satisfy build deps

32d0699

Arvind Sreenivas commited on Mar 8

feat: add vllm for Unsloth fast_inference

a600022

Arvind Sreenivas commited on Mar 8

fix: force-reinstall unsloth to fix PreTrainedConfig NameError

477d0a5

Arvind Sreenivas commited on Mar 8

fix: align Unsloth config with recommended GRPO settings

d1c6fd5

RishbhaJain Claude Sonnet 4.6 commited on Mar 8

Remove ideas

7519b6d

ademarteau commited on Mar 8

metrics: profit first, then service level, then fill rate

39193b5

ademarteau commited on Mar 8

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations

84565ee

ademarteau commited on Mar 8

fix: pipeline-aware ordering, YoY demand signal, reward rebalancing

c10dcd0

RishbhaJain Claude Sonnet 4.6 commited on Mar 8

feat: integrate Unsloth into GRPO training pipeline

4d42a14

RishbhaJain Claude Sonnet 4.6 commited on Mar 8

feat: full-horizon lookahead reward (365 days, <0.5ms)

af5c3c7

Arvind Sreenivas commited on Mar 8

feat: crash-resilient training with dataset caching and iteration resume

9ebd26d

Arvind Sreenivas commited on Mar 8

feat: tee training logs to persistent volume

7658153

Arvind Sreenivas commited on Mar 8

Remove PPO MLP agent, update README

e21ed94

ademarteau commited on Mar 8

feat: add Northflank training Dockerfile and start.sh

c0ce96d

Arvind Sreenivas commited on Mar 8

feat: improve GRPO training logging and fix torch_dtype deprecation

7dea3a9

Arvind Sreenivas commited on Mar 8

fix: let Gradio auto-select port locally, fix via env vars only

6d9b0d9

ademarteau commited on Mar 8

fixed all environment files

56146fe

ademarteau commited on Mar 8

fixed all environment files

ff33a83

ademarteau commited on Mar 8

fix: use 127.0.0.1 locally, 0.0.0.0 in Docker via env var

b7d695d

ademarteau commited on Mar 8

fix: Dockerfile runs Gradio app, not FastAPI server

b5c037f

ademarteau commited on Mar 8

Merge branch 'main' of https://huggingface.co/spaces/ademarteau/RL-Inventory-Simulations

7f56785

ademarteau commited on Mar 8

fix: fall back to HF_TOKEN env var for Inference API auth

f8a94b0

ademarteau commited on Mar 8

Upload ppo_inventory.zip with huggingface_hub

ff80aea
verified

ademarteau commited on Mar 8

Merge: accept remote ppo model and gitattributes

ca85d91

ademarteau commited on Mar 8

Upload ppo_inventory.zip with huggingface_hub

c865376

ademarteau commited on Mar 8

Track zip files with LFS

cc5ff9b

ademarteau commited on Mar 8

Merge teammate changes, unify reward via reward.py, add PPO model

043e4e9

ademarteau commited on Mar 8

Added PPO model and reward.py

7ed1454

ademarteau commited on Mar 8

feat: improve training logging with tqdm, timings, GPU memory, ETA

766dc8c

Arvind Sreenivas commited on Mar 8

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations

920573d

ademarteau commited on Mar 8

Added trained PPO model + app.py UI changes for HF Spaces

3cad082

ademarteau commited on Mar 8

fix: add missing ML and simulation packages to requirements.txt

2344156

Arvind Sreenivas commited on Mar 8

fix: remove pywin32 Windows-only packages, use Python 3.13

1091939

Arvind Sreenivas commited on Mar 8

fix: bump to Python 3.13 to match requirements.txt (audioop-lts)

5482efa

Arvind Sreenivas commited on Mar 8

fix: remove scripts/ COPY, scripts dir not in repo

dd1dcec

Arvind Sreenivas commited on Mar 8

Commit History

chore: untrack server/__pycache__ files 4ed16c1

fix: use correct HF router URL /hf-inference/v1/ (not /models/{id}/v1/) 9a9473a

fix: COGS profit model - charge unit cost on sold units not on orders, preventing end-of-period bias against high-inventory strategies 18aac4f

fix: update HF proxy URL to router.huggingface.co (api-inference deprecated 410) 38aa642

fix: serve index.html with no-cache headers to prevent CDN/browser stale cache 2cee429

fix: proxy HF Inference API through FastAPI to bypass HF Spaces CSP c3fc8d4

metrics: profit first, then service level, then fill rate (React UI) e7f1f53

fix: use minimal requirements-server.txt in Docker to avoid downloading torch/ML deps b2065cc

fix: add package-lock.json for npm ci in Docker build 0209151

fix: switch HF Space SDK from gradio to docker 4992459

feat: replace Gradio with React UI — GRPO tab, 730-day sim, 200-entry memory bank b413222

refactor: remove Unsloth, use standard transformers + PEFT 355b2d5

fix: use nvidia/cuda devel base image so vllm can build with CUDA toolkit b52921e

fix: pin torch 2.6.0 + xformers 0.0.29, use Python 3.12 for ML compat d73e520

fix: install torch, xformers, vllm before requirements to avoid build failures ea9e7b6

fix: install torch before vllm/xformers to satisfy build deps 32d0699

feat: add vllm for Unsloth fast_inference a600022

fix: force-reinstall unsloth to fix PreTrainedConfig NameError 477d0a5

fix: align Unsloth config with recommended GRPO settings d1c6fd5

Remove ideas 7519b6d

metrics: profit first, then service level, then fill rate 39193b5

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations 84565ee

fix: pipeline-aware ordering, YoY demand signal, reward rebalancing c10dcd0

feat: integrate Unsloth into GRPO training pipeline 4d42a14

feat: full-horizon lookahead reward (365 days, <0.5ms) af5c3c7

feat: crash-resilient training with dataset caching and iteration resume 9ebd26d

feat: tee training logs to persistent volume 7658153

Remove PPO MLP agent, update README e21ed94

feat: add Northflank training Dockerfile and start.sh c0ce96d

feat: improve GRPO training logging and fix torch_dtype deprecation 7dea3a9

fix: let Gradio auto-select port locally, fix via env vars only 6d9b0d9

fixed all environment files 56146fe

fixed all environment files ff33a83

fix: use 127.0.0.1 locally, 0.0.0.0 in Docker via env var b7d695d

fix: Dockerfile runs Gradio app, not FastAPI server b5c037f

Merge branch 'main' of https://huggingface.co/spaces/ademarteau/RL-Inventory-Simulations 7f56785

fix: fall back to HF_TOKEN env var for Inference API auth f8a94b0

Upload ppo_inventory.zip with huggingface_hub ff80aea verified

Merge: accept remote ppo model and gitattributes ca85d91

Upload ppo_inventory.zip with huggingface_hub c865376

Track zip files with LFS cc5ff9b

Merge teammate changes, unify reward via reward.py, add PPO model 043e4e9

Added PPO model and reward.py 7ed1454

feat: improve training logging with tqdm, timings, GPU memory, ETA 766dc8c

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations 920573d

Added trained PPO model + app.py UI changes for HF Spaces 3cad082

fix: add missing ML and simulation packages to requirements.txt 2344156

fix: remove pywin32 Windows-only packages, use Python 3.13 1091939

fix: bump to Python 3.13 to match requirements.txt (audioop-lts) 5482efa

fix: remove scripts/ COPY, scripts dir not in repo dd1dcec

chore: untrack server/pycache files

4ed16c1

fix: use correct HF router URL /hf-inference/v1/ (not /models/{id}/v1/)

9a9473a

fix: COGS profit model - charge unit cost on sold units not on orders, preventing end-of-period bias against high-inventory strategies

18aac4f

fix: update HF proxy URL to router.huggingface.co (api-inference deprecated 410)

38aa642

fix: serve index.html with no-cache headers to prevent CDN/browser stale cache

2cee429

fix: proxy HF Inference API through FastAPI to bypass HF Spaces CSP

c3fc8d4

metrics: profit first, then service level, then fill rate (React UI)

e7f1f53

fix: use minimal requirements-server.txt in Docker to avoid downloading torch/ML deps

b2065cc

fix: add package-lock.json for npm ci in Docker build

0209151

fix: switch HF Space SDK from gradio to docker

4992459

feat: replace Gradio with React UI — GRPO tab, 730-day sim, 200-entry memory bank

b413222

refactor: remove Unsloth, use standard transformers + PEFT

355b2d5

fix: use nvidia/cuda devel base image so vllm can build with CUDA toolkit

b52921e

fix: pin torch 2.6.0 + xformers 0.0.29, use Python 3.12 for ML compat

d73e520

fix: install torch, xformers, vllm before requirements to avoid build failures

ea9e7b6

fix: install torch before vllm/xformers to satisfy build deps

32d0699

feat: add vllm for Unsloth fast_inference

a600022

fix: force-reinstall unsloth to fix PreTrainedConfig NameError

477d0a5

fix: align Unsloth config with recommended GRPO settings

d1c6fd5

Remove ideas

7519b6d

metrics: profit first, then service level, then fill rate

39193b5

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations

84565ee

fix: pipeline-aware ordering, YoY demand signal, reward rebalancing

c10dcd0

feat: integrate Unsloth into GRPO training pipeline

4d42a14

feat: full-horizon lookahead reward (365 days, <0.5ms)

af5c3c7

feat: crash-resilient training with dataset caching and iteration resume

9ebd26d

feat: tee training logs to persistent volume

7658153

Remove PPO MLP agent, update README

e21ed94

feat: add Northflank training Dockerfile and start.sh

c0ce96d

feat: improve GRPO training logging and fix torch_dtype deprecation

7dea3a9

fix: let Gradio auto-select port locally, fix via env vars only

6d9b0d9

fixed all environment files

56146fe

fixed all environment files

ff33a83

fix: use 127.0.0.1 locally, 0.0.0.0 in Docker via env var

b7d695d

fix: Dockerfile runs Gradio app, not FastAPI server

b5c037f

Merge branch 'main' of https://huggingface.co/spaces/ademarteau/RL-Inventory-Simulations

7f56785

fix: fall back to HF_TOKEN env var for Inference API auth

f8a94b0

Upload ppo_inventory.zip with huggingface_hub

ff80aea
verified

Merge: accept remote ppo model and gitattributes

ca85d91

Upload ppo_inventory.zip with huggingface_hub

c865376

Track zip files with LFS

cc5ff9b

Merge teammate changes, unify reward via reward.py, add PPO model

043e4e9

Added PPO model and reward.py

7ed1454

feat: improve training logging with tqdm, timings, GPU memory, ETA

766dc8c

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations

920573d

Added trained PPO model + app.py UI changes for HF Spaces

3cad082

fix: add missing ML and simulation packages to requirements.txt

2344156

fix: remove pywin32 Windows-only packages, use Python 3.13

1091939

fix: bump to Python 3.13 to match requirements.txt (audioop-lts)

5482efa

fix: remove scripts/ COPY, scripts dir not in repo

dd1dcec