NickupAI
/

alphabypass3

Reinforcement Learning

censorship-circumvention

Model card Files Files and versions

NickupAI commited on 21 days ago

Commit

ec51225

·

verified ·

1 Parent(s): b483a85

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -38,7 +38,7 @@ Instead of manually tuning parameters, a neural network figures it out by trial
 | Algorithm | PPO (Proximal Policy Optimization) |
 | Action space | Mixed discrete + continuous |
 | Observation space | 75-dimensional vector |
-| Training episodes | ~1,100 |
 | Target protocol | VLESS + REALITY (xray-core) |
 | Success rate | **93%** |
 | Avg reward | +0.81 (scale: −1.0 to +1.0) |

 | Algorithm | PPO (Proximal Policy Optimization) |
 | Action space | Mixed discrete + continuous |
 | Observation space | 75-dimensional vector |
+| Training episodes (basically VPNs tried) | ~1,100 |
 | Target protocol | VLESS + REALITY (xray-core) |
 | Success rate | **93%** |
 | Avg reward | +0.81 (scale: −1.0 to +1.0) |