hw4_tokenizer_50k / README.md
alexgichamba's picture
Update README.md
b80f853 verified
metadata
library_name: transformers
tags: []

Tokenizer

A tokenizer with a vocab size of 50k for Intro to Deep Learning Homework 4 on Language Modelling and Automatic Speech Recognition.

The tokenizer was trained on LibriSpeech LM text