transformers>=4.44.0 torch>=2.2 accelerate>=0.30.0 safetensors>=0.4.2 pillow>=10.3 torchvision facenet-pytorch numpy scenedetect opencv-python wordfreq easyocr pydantic==2.10.6