World-Language-Action Model for Unified World Modeling, Language Reasoning, and Action Synthesis Paper • 2606.05979 • Published 27 days ago • 9
World-Language-Action Model for Unified World Modeling, Language Reasoning, and Action Synthesis Paper • 2606.05979 • Published 27 days ago • 9
ProductWebGen: Benchmarking Multimodal Product Webpage Generation Paper • 2606.01022 • Published May 31 • 5
ProductWebGen: Benchmarking Multimodal Product Webpage Generation Paper • 2606.01022 • Published May 31 • 5
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published Apr 2 • 32
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published Jan 15 • 32
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding Paper • 2512.16229 • Published Dec 18, 2025 • 17
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published Dec 16, 2025 • 44
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight Paper • 2511.16175 • Published Nov 20, 2025 • 12
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing Paper • 2508.09192 • Published Aug 8, 2025 • 31
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks Paper • 2506.00411 • Published May 31, 2025 • 32
Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition Paper • 2505.19788 • Published May 26, 2025 • 13