lixirui142 commited on
Commit
1f6fc59
ยท
verified ยท
1 Parent(s): 072ebc5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ pipeline_tag: other
14
 
15
  FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
16
 
17
- - ๐Ÿ“„ **Paper:** FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning
18
  - ๐ŸŒ **Project page:** https://lixirui142.github.io/flowr2a-project-page/
19
  - ๐Ÿ’ป **Code:** https://github.com/lixirui142/FlowR2A
20
 
 
14
 
15
  FlowR2A is a generative multimodal driving planner that learns the **reward-conditioned action distribution** p(a|r) with **flow matching**. Instead of treating simulation-based rewards as *discriminative targets* (as in scoring-based planners), FlowR2A reframes them as *generative conditions*, unifying the dense supervision of scoring-based methods with the dynamic proposal generation of anchor-based methods in a single model. This forces the planner to internalize how an action relates to its outcomes in safety, progress, comfort, and rule compliance.
16
 
17
+ - ๐Ÿ“„ **Paper:** https://arxiv.org/abs/2606.24231
18
  - ๐ŸŒ **Project page:** https://lixirui142.github.io/flowr2a-project-page/
19
  - ๐Ÿ’ป **Code:** https://github.com/lixirui142/FlowR2A
20