신 가은

zhuzixuan

AI & ML interests

Embodied AI and robotics prototypes.

Recent Activity

upvoted a paper about 8 hours ago

Optimizing Visual Generative Models via Distribution-wise Rewards

liked a dataset 13 days ago

coffeelake6/leroobot_sample_20260620_215902

upvoted a paper 13 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

Optimizing Visual Generative Models via Distribution-wise Rewards

Paper • 2607.02291 • Published 2 days ago • 12

upvoted a paper 13 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Paper • 2606.18023 • Published 18 days ago • 209

upvoted 4 papers about 1 month ago

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published May 28 • 250

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published May 7 • 238

Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning

Paper • 2605.13852 • Published Mar 25 • 26

upvoted 2 papers about 2 months ago

PianoCoRe: Combined and Refined Piano MIDI Dataset

Paper • 2605.06627 • Published May 7 • 7

Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts

Paper • 2602.03473 • Published May 8 • 11

upvoted 7 papers 3 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 638

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published Apr 5 • 53

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 344

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

upvoted 3 papers 4 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248