How's T5 handling longer sequences?

#14

by kelvinspire - opened Mar 10, 2023

Discussion

kelvinspire

Mar 10, 2023

I'm curious to know how T5 is handling longer sequences behind the scenes, does it chunk the inputs? Any ideas?

juusohugs

Mar 22, 2023

Don't trust on me on this, but I think it can handle up to 512 tokens (and truncates after that) but unfortunately cannot remember where I saw this information and cannot say if this is correct :) 512 tokens is roughly 400 words.

kelvinspire

Mar 23, 2023

@juusohugs I will leave this https://github.com/google-research/FLAN/issues/36#issuecomment-1472282261 here. This thread has the answer to this question.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment