Running Featured 1.34k FineWeb: decanting the web for the finest text data at scale π· 1.34k Explore and download the FineWeb webβtext dataset
Paused Featured 133 Pdf To Structured Data π 133 PDF to Structured Data powered by Google DeepMind Gemini 2.0
distilbert/distilbert-base-uncased-finetuned-sst-2-english Text Classification β’ 67M β’ Updated Dec 19, 2023 β’ 3.47M β’ β’ 894
sentence-transformers/paraphrase-multilingual-mpnet-base-v2 Sentence Similarity β’ 0.3B β’ Updated Aug 19, 2025 β’ 5.81M β’ β’ 460
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity β’ 22.7M β’ Updated Mar 6, 2025 β’ 249M β’ β’ 4.77k
deepset/minilm-uncased-squad2 Question Answering β’ 33.4M β’ Updated Sep 26, 2024 β’ 55.6k β’ β’ 47