Instructions to use google/flan-t5-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/flan-t5-large with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("google/flan-t5-large") model = AutoModelForSeq2SeqLM.from_pretrained("google/flan-t5-large") - Notebooks
- Google Colab
- Kaggle
How's T5 handling longer sequences?
#14
by kelvinspire - opened
I'm curious to know how T5 is handling longer sequences behind the scenes, does it chunk the inputs? Any ideas?
Don't trust on me on this, but I think it can handle up to 512 tokens (and truncates after that) but unfortunately cannot remember where I saw this information and cannot say if this is correct :) 512 tokens is roughly 400 words.
@juusohugs I will leave this https://github.com/google-research/FLAN/issues/36#issuecomment-1472282261 here. This thread has the answer to this question.