raghavendram
/

supply-chain-env

Reinforcement Learning

curriculum-learning

Model card Files Files and versions

supply-chain-env

Commit History

fix: clamp ALL rewards/scores to strict (0.01, 0.99) — every output path

29994af

ragavrida commited on Apr 11

fix: clamp all scores to strict (0,1) — never 0.0 or 1.0

520454b

ragavrida commited on Apr 11

fix: match EXACT sample inference pattern — HF_TOKEN or API_KEY, getenv with fallbacks

eef2674

ragavrida commited on Apr 11

fix: test LLM proxy FIRST before env connection — ensures API call is made

393727e

ragavrida commited on Apr 11

fix: use exactly os.environ[API_BASE_URL] and os.environ[API_KEY] — no fallbacks

470383e

ragavrida commited on Apr 11

fix: bulletproof inference — never crash, always emit START/END, defensive parsing

4c49198

ragavrida commited on Apr 11

fix: add error handling for from_docker_image + full traceback logging

723c4a5

ragavrida commited on Apr 11

fix: match reference inference pattern — HF_TOKEN, from_docker_image, no fallback URL

a4e3468

ragavrida commited on Apr 11

fix: use only platform API_BASE_URL and API_KEY, no fallbacks

2538ea3

ragavrida commited on Apr 11

add uv.lock for openenv validate

428321a

ragavrida commited on Apr 11

feat: 3 tasks with programmatic graders + OPENAI_API_KEY support

af0f6eb

ragavrida commited on Apr 11

docs: rewrite README — adaptive curriculum front and center

45e50e5

ragavrida commited on Apr 11

feat: adaptive curriculum — environment learns from agent and gets harder

c63ea5a

ragavrida commited on Apr 11

feat: real data wired in, visual map, all 5 improvements complete

5e5efc0

ragavrida commited on Apr 11

feat: real data, Gymnasium wrapper, baseline comparison, research framing

67e22e7

ragavrida commited on Apr 11

feat: add live demo and baseline agent

25c27c9

ragavrida commited on Apr 11

feat: SupplyChainEnv — global supply chain disruption RL environment

af6c6b1

ragavrida Claude Opus 4.6 (1M context) commited on Apr 11

initial commit

f576258
verified

ragavrida commited on Apr 11