Instructions to use 2045max/finrl-ppo-dow30-quick with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use 2045max/finrl-ppo-dow30-quick with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="2045max/finrl-ppo-dow30-quick", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
| library_name: stable-baselines3 | |
| tags: | |
| - reinforcement-learning | |
| - finrl | |
| - ppo | |
| - stock-trading | |
| # FinRL PPO Agent (Quick Demo, 2000 steps) | |
| Trained on DOW 30 stocks (2014-2025) using FinRL + Stable-Baselines3 PPO. | |
| ⚠️ **Toy model — only 2000 timesteps**, used to validate training pipeline. | |
| Not for real trading. | |
| ## Usage | |
| ```python | |
| from huggingface_hub import hf_hub_download | |
| from stable_baselines3 import PPO | |
| path = hf_hub_download( | |
| repo_id="2045max/finrl-ppo-dow30-quick", | |
| filename="agent_ppo.zip" | |
| ) | |
| model = PPO.load(path) | |
| ``` | |
| ## Training Setup | |
| - Algorithm: PPO (Proximal Policy Optimization) | |
| - Total timesteps: 2,000 | |
| - State space: 301 (cash + 30 prices + 30 holdings + 30×8 indicators) | |
| - Action space: 30 (continuous, [-1, 1] per stock) | |
| - Reward: portfolio value change × 1e-4 | |
| ## Source | |
| https://github.com/AI4Finance-Foundation/FinRL | |