Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
ademarteau
/
RL-Inventory-Simulations
like
0
Runtime error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
RL-Inventory-Simulations
/
agent
53.3 kB
Ctrl+K
Ctrl+K
9 contributors
History:
15 commits
RishbhaJain
refactor: remove Unsloth, use standard transformers + PEFT
355b2d5
2 months ago
__init__.py
0 Bytes
Add three-agent system: Claude LLM, PPO RL, and GRPO fine-tuned Qwen
2 months ago
finetune_agent.py
Safe
11.7 kB
refactor: remove Unsloth, use standard transformers + PEFT
2 months ago
llm_agent.py
Safe
10.5 kB
Add P&L reward function, daily spoilage, stochastic lead time, and reward visualization
2 months ago
train_grpo.py
Safe
31.1 kB
refactor: remove Unsloth, use standard transformers + PEFT
2 months ago