Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

thegoodfellas
/

tgf-bpe-tokenizer

Model card Files Files and versions

Instructions to use thegoodfellas/tgf-bpe-tokenizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use thegoodfellas/tgf-bpe-tokenizer with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("fill-mask", model="thegoodfellas/tgf-bpe-tokenizer")

# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("thegoodfellas/tgf-bpe-tokenizer")
model = AutoModelForMaskedLM.from_pretrained("thegoodfellas/tgf-bpe-tokenizer")

Notebooks
Google Colab
Kaggle

tgf-bpe-tokenizer

2.89 MB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

rdemorais's picture

Update README.md

9e0fe7a about 3 years ago

.gitattributes

1.48 kB
initial commit about 3 years ago
README.md

42 Bytes
Update README.md about 3 years ago
config.json

646 Bytes
trained from thegoodfellas/mc4-pt-cleaned about 3 years ago
merges.txt

414 kB
trained from thegoodfellas/mc4-pt-cleaned about 3 years ago
tokenizer.json

1.79 MB
trained from thegoodfellas/mc4-pt-cleaned about 3 years ago
vocab.json

688 kB
trained from thegoodfellas/mc4-pt-cleaned about 3 years ago