gradio bitsandbytes sentencepiece protobuf scipy opencv-python moviepy==1.0.3 numpy imageio imageio-ffmpeg requests torchvision google openai google-genai qwen-omni-utils soundfile torch transformers==4.52.3 accelerate ms-swift