lixirui142 commited on
Commit
072ebc5
·
verified ·
1 Parent(s): 45262df

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ pipeline_tag: other
12
 
13
  # FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
14
 
15
- FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** $p(a \mid r)$ with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
16
 
17
  - 📄 **Paper:** FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
18
  - 🌐 **Project page:** https://lixirui142.github.io/flowr2a-project-page/
 
12
 
13
  # FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
14
 
15
+ FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
16
 
17
  - 📄 **Paper:** FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
18
  - 🌐 **Project page:** https://lixirui142.github.io/flowr2a-project-page/