gradio sentence-transformers PyMuPDF scikit-learn nltk pandas