Abrupt summaries
#2
by
epartalidou
- opened
Hello! I am attempting to use the model in a dataset with long sequences (>10K tokens), but the output in many cases has no end and just reaches the MAX_NEW_TOKENS, instead of choosing EOS. Any thoughts on why this is happening?