12 53 58

Tong Zhu

Spico

https://Spico197.github.io

AI & ML interests

Information Extraction, Mixture-of-Experts, LLM

Recent Activity

liked a model 15 days ago

Xiaoye08/HRM-MoE

upvoted a paper 17 days ago

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

upvoted a paper 23 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

View all activity

Organizations

upvoted a paper 17 days ago

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Paper • 2605.19587 • Published May 19 • 10

upvoted a paper 23 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published 28 days ago • 36

upvoted 2 papers about 1 month ago

ACC: Compiling Agent Trajectories for Long-Context Training

Paper • 2605.21850 • Published May 21 • 60

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted 2 papers 2 months ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

PlayCoder: Making LLM-Generated GUI Code Playable

Paper • 2604.19742 • Published Apr 21 • 26

upvoted a paper 3 months ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 87

upvoted 2 articles 4 months ago

Article

Your MoE Model Does Not Have to Select Fixed Number of Experts

Spico

•

Feb 26

• 7

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

upvoted a paper 4 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

upvoted 5 papers 5 months ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Paper • 2601.11969 • Published Jan 17 • 27

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published Jan 15 • 63

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published Jan 20 • 57

upvoted an article 5 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 114

upvoted a paper 6 months ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 52

upvoted 3 papers 7 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 247

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

Tong Zhu

AI & ML interests

Recent Activity

Organizations

Spico's activity

Your MoE Model Does Not Have to Select Fixed Number of Experts

Transformers v5: Simple model definitions powering the AI ecosystem

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models