Translation
Transformers
PyTorch
TensorFlow
JAX
Rust
Safetensors
t5
text2text-generation
summarization
text-generation-inference
Instructions to use google-t5/t5-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google-t5/t5-base with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="google-t5/t5-base")# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("google-t5/t5-base") model = AutoModelForSeq2SeqLM.from_pretrained("google-t5/t5-base") - Inference
- Notebooks
- Google Colab
- Kaggle
Adds the tokenizer configuration file
#25
by lysandre HF Staff - opened
The tokenizer configuration file is missing/incorrect and therefore leading to unforeseen errors after the migration of the canonical models.
Refer to the following issue for more information: transformers#29050
The current failing code is the following:
from transformers import AutoTokenizer
>>> previous_tokenizer = AutoTokenizer.from_pretrained("t5-base")
>>> current_tokenizer = AutoTokenizer.from_pretrained("google-t5/t5-base")
>>> print(previous_tokenizer.model_max_length, current_tokenizer.model_max_length)
1000000000000000019884624838656, 512
This is the result after the fix:
from transformers import AutoTokenizer
>>> previous_tokenizer = AutoTokenizer.from_pretrained("t5-base")
>>> current_tokenizer = AutoTokenizer.from_pretrained("google-t5/t5-base")
>>> print(previous_tokenizer.model_max_length, current_tokenizer.model_max_length)
512, 512