mahithakur commited on
Commit
022be04
Β·
1 Parent(s): b91005f

Fix division by zero in final avg improvement calculation

Browse files
Files changed (1) hide show
  1. training/eval_report.py +1 -1
training/eval_report.py CHANGED
@@ -178,7 +178,7 @@ def generate_report(before: dict, after: dict, output_dir: pathlib.Path) -> str:
178
  | **Avg Reward** | {before["overall_avg_reward"]:.4f} | {after["overall_avg_reward"]:.4f} | **{overall_improvement:+.2f}%** βœ… |
179
  | **Max Reward** | {before["max_reward"]:.4f} | {after["max_reward"]:.4f} | +{after["max_reward"] - before["max_reward"]:.4f} |
180
  | **Episodes** | {before.get("total_episodes", len(before["episodes"]))} | {after["total_episodes"]} | β€” |
181
- | **Final 10 Avg** | {before["overall_avg_reward"]:.4f} | {after["final_avg"]:.4f} | **{((after["final_avg"] - before["overall_avg_reward"]) / abs(before["overall_avg_reward"]) * 100):+.2f}%** |
182
 
183
  ### πŸ“ˆ Per-Task Breakdown
184
 
 
178
  | **Avg Reward** | {before["overall_avg_reward"]:.4f} | {after["overall_avg_reward"]:.4f} | **{overall_improvement:+.2f}%** βœ… |
179
  | **Max Reward** | {before["max_reward"]:.4f} | {after["max_reward"]:.4f} | +{after["max_reward"] - before["max_reward"]:.4f} |
180
  | **Episodes** | {before.get("total_episodes", len(before["episodes"]))} | {after["total_episodes"]} | β€” |
181
+ | **Final 10 Avg** | {before["overall_avg_reward"]:.4f} | {after["final_avg"]:.4f} | **{(((after["final_avg"] - before["overall_avg_reward"]) / abs(before["overall_avg_reward"]) * 100) if before["overall_avg_reward"] != 0 else 0):+.2f}%** |
182
 
183
  ### πŸ“ˆ Per-Task Breakdown
184