Update README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,7 @@ Instead of manually tuning parameters, a neural network figures it out by trial
|
|
| 38 |
| Algorithm | PPO (Proximal Policy Optimization) |
|
| 39 |
| Action space | Mixed discrete + continuous |
|
| 40 |
| Observation space | 75-dimensional vector |
|
| 41 |
-
| Training episodes | ~1,100 |
|
| 42 |
| Target protocol | VLESS + REALITY (xray-core) |
|
| 43 |
| Success rate | **93%** |
|
| 44 |
| Avg reward | +0.81 (scale: −1.0 to +1.0) |
|
|
|
|
| 38 |
| Algorithm | PPO (Proximal Policy Optimization) |
|
| 39 |
| Action space | Mixed discrete + continuous |
|
| 40 |
| Observation space | 75-dimensional vector |
|
| 41 |
+
| Training episodes (basically VPNs tried) | ~1,100 |
|
| 42 |
| Target protocol | VLESS + REALITY (xray-core) |
|
| 43 |
| Success rate | **93%** |
|
| 44 |
| Avg reward | +0.81 (scale: −1.0 to +1.0) |
|