Instructions to use facebook/m2m100_1.2B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use facebook/m2m100_1.2B with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("facebook/m2m100_1.2B") model = AutoModelForSeq2SeqLM.from_pretrained("facebook/m2m100_1.2B") - Notebooks
- Google Colab
- Kaggle
m2mtokenizer doesn't know the word "wouldn't"
#2
by anzorq - opened
I accidentally discovered that the tokenizer tokenizes the word "wouldn't" as ['<unk>', "'", 't'].
It doesn't seem to affect model's performance, but makes me wonder what else the tokenizer doesn't have in its vocabulary.
This comment has been hidden