Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ pipeline_tag: other
|
|
| 12 |
|
| 13 |
# FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
|
| 14 |
|
| 15 |
-
FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution**
|
| 16 |
|
| 17 |
- 📄 **Paper:** FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
|
| 18 |
- 🌐 **Project page:** https://lixirui142.github.io/flowr2a-project-page/
|
|
|
|
| 12 |
|
| 13 |
# FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
|
| 14 |
|
| 15 |
+
FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
|
| 16 |
|
| 17 |
- 📄 **Paper:** FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
|
| 18 |
- 🌐 **Project page:** https://lixirui142.github.io/flowr2a-project-page/
|