swarm-ppo-drone / README.md
lia21's picture
Upload PPO drone policy + logs
bae773a
metadata
tags:
  - ppo
  - reinforcement-learning
  - swarm
  - drone
  - bittensor
license: mit
language:
  - en
library_name: stable-baselines3

πŸš€ Swarm PPO Drone

This repository contains a Proximal Policy Optimization (PPO) model trained for swarm/drone control.
The model was trained using Gymnasium environments with Stable-Baselines3 and exported for use in Bittensor Subnet 124 (Swarm).


πŸ“‚ Files

  • policy.pth – Trained PPO policy weights (PyTorch).
  • ppo_policy.zip – Stable-Baselines3 PPO saved model (reload with PPO.load()).
  • safe_policy_meta.json – Metadata for policy compliance.
  • best/ – Best checkpointed model during training.
  • eval_logs/ – Evaluation logs.
  • tb_logs/ – TensorBoard logs.

πŸ› οΈ Usage

Load with Stable-Baselines3

from stable_baselines3 import PPO
import gymnasium as gym

# Load model
model = PPO.load("ppo_policy.zip")

# Example run
env = gym.make("CartPole-v1")
obs, _ = env.reset()
action, _ = model.predict(obs)
print("Predicted action:", action)