Found a critical bug in LSTM inference when distillation using BERT Tokenizer but inference uses custom LSTM tokenizer 77bc910 jesse-tong commited on Mar 31, 2025