d186427 86aab97
1
2
3
4
5
6
7
8
9
10
11
12
13
14
--- title: CodeReview Training emoji: 🤖 colorFrom: blue colorTo: green sdk: gradio sdk_version: 4.44.0 app_file: app.py pinned: false --- # CodeReview PPO Training This Space trains an LLM agent to fix injected bugs using PPO and rubrics.