Has anyone achieved a speed-up with this model?
#3 opened about 1 year ago
by
RonanMcGovern
Add text-generation pipeline tag and MIT license
#2 opened about 1 year ago
by
nielsr
Is this MTP head just for predicting one token ahead?
#1 opened about 1 year ago
by
RonanMcGovern