Instructions to use google/long-t5-tglobal-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/long-t5-tglobal-base with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("google/long-t5-tglobal-base") model = AutoModelForSeq2SeqLM.from_pretrained("google/long-t5-tglobal-base") - Notebooks
- Google Colab
- Kaggle
Can this run on FLAN t5?
#5
by ljhwild - opened
I'm just reading the paper and it appears long t5 runs on t5 and not on flan t5.
Is there any reason why?
Hello! Both t5 and flan-t5 have the same model architecture. You can see in flan-t5's model card that it is using the t5 architecture under the hood: https://huggingface.co/google/flan-t5-xxl/blob/main/config.json#L3
However, long-t5 has a slightly different architecture to enable it to scale to longer sequences.
Hope that helps!