Feature Extraction
Transformers
gpt2
tokenizer
embeddings
unicode