Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ pipeline_tag: other
|
|
| 14 |
|
| 15 |
FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
|
| 16 |
|
| 17 |
-
- ๐ **Paper:**
|
| 18 |
- ๐ **Project page:** https://lixirui142.github.io/flowr2a-project-page/
|
| 19 |
- ๐ป **Code:** https://github.com/lixirui142/FlowR2A
|
| 20 |
|
|
|
|
| 14 |
|
| 15 |
FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
|
| 16 |
|
| 17 |
+
- ๐ **Paper:** https://arxiv.org/abs/2606.24231
|
| 18 |
- ๐ **Project page:** https://lixirui142.github.io/flowr2a-project-page/
|
| 19 |
- ๐ป **Code:** https://github.com/lixirui142/FlowR2A
|
| 20 |
|