Spaces:

ademarteau
/

RL-Inventory-Simulations

Runtime error

App Files Files Community

RL-Inventory-Simulations / agent

53.3 kB

Ctrl+K

Ctrl+K

9 contributors

History: 15 commits

RishbhaJain

refactor: remove Unsloth, use standard transformers + PEFT

355b2d5 4 months ago

__init__.py

0 Bytes
Add three-agent system: Claude LLM, PPO RL, and GRPO fine-tuned Qwen 4 months ago
finetune_agent.py

11.7 kB
refactor: remove Unsloth, use standard transformers + PEFT 4 months ago
llm_agent.py

10.5 kB
Add P&L reward function, daily spoilage, stochastic lead time, and reward visualization 4 months ago
train_grpo.py

31.1 kB
refactor: remove Unsloth, use standard transformers + PEFT 4 months ago