1 49 11

Xuanlang Dai

XuanlangDai

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper about 1 month ago

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

upvoted a paper about 1 month ago

ETCHR: Editing To Clarify and Harness Reasoning

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 9 days ago • 46

upvoted 4 papers about 1 month ago

authored a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

liked a model about 1 month ago

internlm/Intern-S2-Preview

Image-Text-to-Text • 36B • Updated 27 days ago • 4.56k • 109

upvoted a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

upvoted 4 papers about 2 months ago

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 84

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 36

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published Apr 29 • 112

upvoted a paper 2 months ago

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Paper • 2604.19747 • Published Apr 21 • 40

upvoted 2 papers 3 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 328

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 295

liked a model 3 months ago

internlm/Spatial-SSRL-Qwen3VL-4B

Image-Text-to-Text • 5B • Updated Apr 6 • 186 • 14

upvoted 4 papers 3 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 124

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 116

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published Apr 6 • 36

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 87

Xuanlang Dai

AI & ML interests

Recent Activity

Organizations

XuanlangDai's activity