UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 7 days ago • 80
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR Paper • 2509.23808 • Published Sep 28, 2025 • 47
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published Jun 23, 2025 • 89
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation Paper • 2505.18078 • Published May 23, 2025 • 6
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation Paper • 2505.18078 • Published May 23, 2025 • 6
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation Paper • 2505.18078 • Published May 23, 2025 • 6 • 2
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object Paper • 2311.13562 • Published Nov 22, 2023 • 1
ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models Paper • 2308.14353 • Published Aug 28, 2023
DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters Paper • 2411.17423 • Published Nov 26, 2024
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs Paper • 2404.04363 • Published Apr 5, 2024
Running on Zero Agents Featured 5.06k FLUX.1 [Schnell] 🏎 5.06k Generate images from text prompts with FLUX.1 Schnell
Running on Zero Agents Featured 4.78k TRELLIS 🏢 4.78k Scalable and Versatile 3D Generation from images
Running on Zero Agents 98 Make It Animatable 💃 98 Authoring Animation-Ready 3D Characters with One Click