Running Featured 1.31k FineWeb: decanting the web for the finest text data at scale 🍷 1.31k Generate a curated web‑text dataset for LLM training
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated Jan 28 • 23.7M • • 1.16k
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 511