Spaces:
Runtime error
Runtime error
Commit Β·
022be04
1
Parent(s): b91005f
Fix division by zero in final avg improvement calculation
Browse files- training/eval_report.py +1 -1
training/eval_report.py
CHANGED
|
@@ -178,7 +178,7 @@ def generate_report(before: dict, after: dict, output_dir: pathlib.Path) -> str:
|
|
| 178 |
| **Avg Reward** | {before["overall_avg_reward"]:.4f} | {after["overall_avg_reward"]:.4f} | **{overall_improvement:+.2f}%** β
|
|
| 179 |
| **Max Reward** | {before["max_reward"]:.4f} | {after["max_reward"]:.4f} | +{after["max_reward"] - before["max_reward"]:.4f} |
|
| 180 |
| **Episodes** | {before.get("total_episodes", len(before["episodes"]))} | {after["total_episodes"]} | β |
|
| 181 |
-
| **Final 10 Avg** | {before["overall_avg_reward"]:.4f} | {after["final_avg"]:.4f} | **{((after["final_avg"] - before["overall_avg_reward"]) / abs(before["overall_avg_reward"]) * 100):+.2f}%** |
|
| 182 |
|
| 183 |
### π Per-Task Breakdown
|
| 184 |
|
|
|
|
| 178 |
| **Avg Reward** | {before["overall_avg_reward"]:.4f} | {after["overall_avg_reward"]:.4f} | **{overall_improvement:+.2f}%** β
|
|
| 179 |
| **Max Reward** | {before["max_reward"]:.4f} | {after["max_reward"]:.4f} | +{after["max_reward"] - before["max_reward"]:.4f} |
|
| 180 |
| **Episodes** | {before.get("total_episodes", len(before["episodes"]))} | {after["total_episodes"]} | β |
|
| 181 |
+
| **Final 10 Avg** | {before["overall_avg_reward"]:.4f} | {after["final_avg"]:.4f} | **{(((after["final_avg"] - before["overall_avg_reward"]) / abs(before["overall_avg_reward"]) * 100) if before["overall_avg_reward"] != 0 else 0):+.2f}%** |
|
| 182 |
|
| 183 |
### π Per-Task Breakdown
|
| 184 |
|