pkghf/images
Updated
Great Blog. The accuracy is totally dependent on few factors like use of Greedy decoding and high quality of Assistant models.
Also the latency gains is significant with smaller assistant models which creates a tradeoff between its accuracy vs speed