Abdalkaderdev's picture
Reduce tokens for faster CPU inference
d2505af