laner ten's picture

2 7

laner ten

that113

·

AI & ML interests

None yet

Organizations

None yet

liked 2 Spaces 7 months ago

FineWeb: decanting the web for the finest text data at scale

Explore and download the FineWeb web‑text dataset

The Smol Training Playbook

The secrets to building world-class LLMs

liked a dataset 11 months ago

llamafactory/DPO-En-Zh-20k

Viewer • Updated Jun 7, 2024 • 20k • 515 • 102

liked 2 models about 1 year ago

microsoft/Magma-8B

Robotics • 9B • Updated Dec 10, 2025 • 476 • 415

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • 685B • Updated Apr 30, 2025 • 700 • • 829

liked a Space over 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset over 1 year ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 61.6k • 579