transformers datasets torch torchvision pillow pytesseract