JonusNattapong
/

Reinforcement-Learning-for-Gold-Trading-Model

+---
+license: mit
+language: en
+library_name: stable-baselines3
+tags:
+- reinforcement-learning
+- finance
+- gold-trading
+- xauusd
+- ppo
+metrics:
+- sharpe_ratio
+- win_rate
+pipeline_tag: reinforcement-learning
+---
+# PPO Model for XAUUSD Gold Trading
+This repository contains a Reinforcement Learning model trained using Proximal Policy Optimization (PPO) for trading XAUUSD (Gold vs US Dollar) on 15-minute timeframes.
+## Model Details
+- **Model Type**: PPO (Proximal Policy Optimization)
+- **Framework**: Stable-Baselines3
+- **Environment**: Custom Gym environment for XAUUSD trading
+- **Training Data**: Historical XAUUSD data from 2004 to 2025 (resampled to 15-min bars)
+- **Total Timesteps**: 1,000,000
+- **Position Sizing**: Base 5.0 oz, Max 7.5 oz
+- **Initial Capital**: 200 USD
+- **Transaction Cost**: 0.65 USD per oz
+## Performance Metrics (Test Set)
+- **Average Daily Profit**: 51.46 USD
+- **Win Rate**: 69.0%
+- **Max Drawdown**: 12.0%
+- **Sharpe Ratio**: 7.56
+- **Average Trades per Day**: 2.66
+## Features Used
+- Log Return
+- RSI (14-period)
+- Moving Averages (short/long)
+- Bollinger Bands
+- MACD
+- Volume indicators
+## Usage
+### Loading the Model
+```python
+from safetensors.torch import load_file
+from stable_baselines3 import PPO
+import torch
+# Load state dict from safetensors
+state_dict = load_file("ppo_xauusd.safetensors")
+policy = PPO.policy_class(observation_space, action_space)  # Define spaces accordingly
+policy.load_state_dict(state_dict)
+# Create model
+model = PPO(policy=policy, env=env)  # Or load full model if available
+```
+### For Full Inference
+To use the model for trading, you'll need to:
+1. Set up the trading environment (`XAUUSDTradingEnv`)
+2. Load VecNormalize stats
+3. Run predictions
+Note: This is a simulation model. Use with caution in real trading.
+## Training Configuration
+- Learning Rate: 0.0003
+- Batch Size: 256
+- Gamma: 0.99
+- GAE Lambda: 0.95
+- Clip Range: 0.2
+- Entropy Coefficient: 0.01
+## Files
+- `ppo_xauusd.safetensors`: Model weights in SafeTensors format
+- `vecnormalize.pkl`: VecNormalize statistics for observation normalization
+## License
+MIT License
+## Disclaimer
+This model is for educational and research purposes only. Trading involves risk, and past performance does not guarantee future results. Always backtest and validate before using in live trading.