Shuai Wang

Shuaiii

5 151 45

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

longvideobench/LongVideoBench

upvoted a paper 8 days ago

Variable-Width Transformers

liked a dataset 15 days ago

AI-MO/NuminaMath-1.5

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Variable-Width Transformers

Paper • 2606.18246 • Published 14 days ago • 15

upvoted a paper about 1 month ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted a paper about 2 months ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published Apr 29 • 112

upvoted 3 papers 2 months ago

Lyra 2.0: Explorable Generative 3D Worlds

Paper • 2604.13036 • Published Apr 14 • 41

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Paper • 2604.12002 • Published Apr 13 • 12

upvoted a paper 3 months ago

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification

Paper • 2603.26648 • Published Mar 27 • 46

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted a paper 4 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 192

upvoted 2 papers 5 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 196

upvoted 5 papers 6 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 155

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published Dec 24, 2025 • 70

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 97

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published Dec 19, 2025 • 29

upvoted a collection 6 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 749

upvoted a paper 6 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 36

upvoted 2 papers 7 months ago

CaptionQA: Is Your Caption as Useful as the Image Itself?

Paper • 2511.21025 • Published Nov 26, 2025 • 29

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 96

Shuai Wang

AI & ML interests

Recent Activity

Organizations

Shuaiii's activity

Welcome Gemma 4: Frontier multimodal intelligence on device