Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
ademarteau
/
RL-Inventory-Simulations
Runtime error

App Files Files Community
Fetching metadata from the HF Docker repository...
RL-Inventory-Simulations / agent
53.3 kB
Ctrl+K
Ctrl+K
  • 9 contributors
History: 15 commits
RishbhaJain
refactor: remove Unsloth, use standard transformers + PEFT
355b2d5 2 months ago
  • __init__.py
    0 Bytes
    Add three-agent system: Claude LLM, PPO RL, and GRPO fine-tuned Qwen 2 months ago
  • finetune_agent.py
    11.7 kB
    refactor: remove Unsloth, use standard transformers + PEFT 2 months ago
  • llm_agent.py
    10.5 kB
    Add P&L reward function, daily spoilage, stochastic lead time, and reward visualization 2 months ago
  • train_grpo.py
    31.1 kB
    refactor: remove Unsloth, use standard transformers + PEFT 2 months ago