Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thangtm 's Collections
robot
code LLM
data
flow_matching_model
reasoning_model
DLM
RL
ARC
RAG
Reduce_thinking
OCR

data

updated about 1 hour ago
Upvote
-

  • DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

    Paper • 2512.16676 • Published 29 days ago • 207

  • Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

    Paper • 2510.06499 • Published Oct 7, 2025 • 31

  • FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline

    Paper • 2508.16514 • Published Aug 22, 2025 • 1

  • Seed-Coder: Let the Code Model Curate Data for Itself

    Paper • 2506.03524 • Published Jun 4, 2025 • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs