Duplicated from sail/zero-bubble-pipeline-parallellism
Check out our paper at Arxiv.
Bubble Rate here is calculated as (1 - longest stage time/(F+B+W)/m).