Instructions to use vocab-transformers/distilbert-tokenizer_256k-MLM_1M with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use vocab-transformers/distilbert-tokenizer_256k-MLM_1M with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="vocab-transformers/distilbert-tokenizer_256k-MLM_1M")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("vocab-transformers/distilbert-tokenizer_256k-MLM_1M") model = AutoModelForMaskedLM.from_pretrained("vocab-transformers/distilbert-tokenizer_256k-MLM_1M") - Notebooks
- Google Colab
- Kaggle
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
DistilBERT with 256k token embeddings
This model was initialized with a word2vec token embedding matrix with 256k entries, but these token embeddings were updated during MLM. The word2vec was trained on 100GB data from C4, MSMARCO, News, Wikipedia, S2ORC, for 3 epochs.
Then the model was trained on this dataset with MLM for 1M steps (batch size 64). The token embeddings were updated during MLM.
- Downloads last month
- 10