Update research title to: Concentrate or Collapse
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ ReFusion 8B trained with ESPO v19 (ELBO-based Sequence-level Policy Optimization
|
|
| 14 |
|
| 15 |
## Paper
|
| 16 |
|
| 17 |
-
**
|
| 18 |
- Author: Muhammad Enrizky Brillian
|
| 19 |
- Institution: University of Toronto Scarborough
|
| 20 |
|
|
@@ -30,7 +30,7 @@ If you use this model, please cite:
|
|
| 30 |
|
| 31 |
```bibtex
|
| 32 |
@article{brillian2026flowgrpo,
|
| 33 |
-
title={
|
| 34 |
author={Brillian, Muhammad Enrizky},
|
| 35 |
year={2026}
|
| 36 |
}
|
|
|
|
| 14 |
|
| 15 |
## Paper
|
| 16 |
|
| 17 |
+
**Concentrate or Collapse: When Reinforcement Learning Meets Diffusion Language Models for Web Planning**
|
| 18 |
- Author: Muhammad Enrizky Brillian
|
| 19 |
- Institution: University of Toronto Scarborough
|
| 20 |
|
|
|
|
| 30 |
|
| 31 |
```bibtex
|
| 32 |
@article{brillian2026flowgrpo,
|
| 33 |
+
title={Concentrate or Collapse: When Reinforcement Learning Meets Diffusion Language Models for Web Planning},
|
| 34 |
author={Brillian, Muhammad Enrizky},
|
| 35 |
year={2026}
|
| 36 |
}
|