Commit History

chore: untrack server/__pycache__ files
4ed16c1

ademarteau commited on

fix: use correct HF router URL /hf-inference/v1/ (not /models/{id}/v1/)
9a9473a

ademarteau commited on

fix: COGS profit model - charge unit cost on sold units not on orders, preventing end-of-period bias against high-inventory strategies
18aac4f

ademarteau commited on

fix: update HF proxy URL to router.huggingface.co (api-inference deprecated 410)
38aa642

ademarteau commited on

fix: serve index.html with no-cache headers to prevent CDN/browser stale cache
2cee429

ademarteau commited on

fix: proxy HF Inference API through FastAPI to bypass HF Spaces CSP
c3fc8d4

ademarteau commited on

metrics: profit first, then service level, then fill rate (React UI)
e7f1f53

ademarteau commited on

fix: use minimal requirements-server.txt in Docker to avoid downloading torch/ML deps
b2065cc

ademarteau commited on

fix: add package-lock.json for npm ci in Docker build
0209151

ademarteau commited on

fix: switch HF Space SDK from gradio to docker
4992459

ademarteau commited on

feat: replace Gradio with React UI — GRPO tab, 730-day sim, 200-entry memory bank
b413222

ademarteau commited on

refactor: remove Unsloth, use standard transformers + PEFT
355b2d5

RishbhaJain Claude Sonnet 4.6 commited on

fix: use nvidia/cuda devel base image so vllm can build with CUDA toolkit
b52921e

Arvind Sreenivas commited on

fix: pin torch 2.6.0 + xformers 0.0.29, use Python 3.12 for ML compat
d73e520

Arvind Sreenivas commited on

fix: install torch, xformers, vllm before requirements to avoid build failures
ea9e7b6

Arvind Sreenivas commited on

fix: install torch before vllm/xformers to satisfy build deps
32d0699

Arvind Sreenivas commited on

feat: add vllm for Unsloth fast_inference
a600022

Arvind Sreenivas commited on

fix: force-reinstall unsloth to fix PreTrainedConfig NameError
477d0a5

Arvind Sreenivas commited on

fix: align Unsloth config with recommended GRPO settings
d1c6fd5

RishbhaJain Claude Sonnet 4.6 commited on

Remove ideas
7519b6d

ademarteau commited on

metrics: profit first, then service level, then fill rate
39193b5

ademarteau commited on

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations
84565ee

ademarteau commited on

fix: pipeline-aware ordering, YoY demand signal, reward rebalancing
c10dcd0

RishbhaJain Claude Sonnet 4.6 commited on

feat: integrate Unsloth into GRPO training pipeline
4d42a14

RishbhaJain Claude Sonnet 4.6 commited on

feat: full-horizon lookahead reward (365 days, <0.5ms)
af5c3c7

Arvind Sreenivas commited on

feat: crash-resilient training with dataset caching and iteration resume
9ebd26d

Arvind Sreenivas commited on

feat: tee training logs to persistent volume
7658153

Arvind Sreenivas commited on

Remove PPO MLP agent, update README
e21ed94

ademarteau commited on

feat: add Northflank training Dockerfile and start.sh
c0ce96d

Arvind Sreenivas commited on

feat: improve GRPO training logging and fix torch_dtype deprecation
7dea3a9

Arvind Sreenivas commited on

fix: let Gradio auto-select port locally, fix via env vars only
6d9b0d9

ademarteau commited on

fixed all environment files
56146fe

ademarteau commited on

fixed all environment files
ff33a83

ademarteau commited on

fix: use 127.0.0.1 locally, 0.0.0.0 in Docker via env var
b7d695d

ademarteau commited on

fix: Dockerfile runs Gradio app, not FastAPI server
b5c037f

ademarteau commited on

Merge branch 'main' of https://huggingface.co/spaces/ademarteau/RL-Inventory-Simulations
7f56785

ademarteau commited on

fix: fall back to HF_TOKEN env var for Inference API auth
f8a94b0

ademarteau commited on

Upload ppo_inventory.zip with huggingface_hub
ff80aea
verified

ademarteau commited on

Merge: accept remote ppo model and gitattributes
ca85d91

ademarteau commited on

Upload ppo_inventory.zip with huggingface_hub
c865376

ademarteau commited on

Track zip files with LFS
cc5ff9b

ademarteau commited on

Merge teammate changes, unify reward via reward.py, add PPO model
043e4e9

ademarteau commited on

Added PPO model and reward.py
7ed1454

ademarteau commited on

feat: improve training logging with tqdm, timings, GPU memory, ETA
766dc8c

Arvind Sreenivas commited on

Merge branch 'main' of https://github.com/ademcodesproducts/OpenEnv-Inventory-Simulations
920573d

ademarteau commited on

Added trained PPO model + app.py UI changes for HF Spaces
3cad082

ademarteau commited on

fix: add missing ML and simulation packages to requirements.txt
2344156

Arvind Sreenivas commited on

fix: remove pywin32 Windows-only packages, use Python 3.13
1091939

Arvind Sreenivas commited on

fix: bump to Python 3.13 to match requirements.txt (audioop-lts)
5482efa

Arvind Sreenivas commited on

fix: remove scripts/ COPY, scripts dir not in repo
dd1dcec

Arvind Sreenivas commited on