Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers โข 70 items โข Updated Dec 10, 2025 โข 172
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity โข 0.1B โข Updated Jan 28 โข 49.7M โข โข 1.25k