Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Yuxin Chen

Uasonchen

7

21world's profile picture

·

AI & ML interests

None yet

Organizations

Uasonchen 's collections 9

Video Understanding

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 20.8k • 238
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 25.5k • 197

Image Generation

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

Vision Foundation Model

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 311

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85
Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

Math Data Synthesis

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 42

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

Video Generation

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 98
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120
LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 130

Open Math Data for LLM

dyyyyyyyy/ScaleQuest-Math

Viewer • Updated Oct 28, 2024 • 1M • 35 • 23
AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 33.1k • 592

Video Understanding

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 20.8k • 238
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 25.5k • 197

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

Image Generation

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

Vision Foundation Model

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 311

Video Generation

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 98
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120
LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 130

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85
Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

Open Math Data for LLM

dyyyyyyyy/ScaleQuest-Math

Viewer • Updated Oct 28, 2024 • 1M • 35 • 23
AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 33.1k • 592

Math Data Synthesis

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 42

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs