Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper β’ 2603.03143 β’ Published 9 days ago β’ 133
Helios: Real Real-Time Long Video Generation Model Paper β’ 2603.04379 β’ Published 8 days ago β’ 159
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published 20 days ago β’ 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 21 days ago β’ 483
PaperBanana: Automating Academic Illustration for AI Scientists Paper β’ 2601.23265 β’ Published Jan 30 β’ 216
Latent Diffusion Model without Variational Autoencoder Paper β’ 2510.15301 β’ Published Oct 17, 2025 β’ 49
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper β’ 2510.15742 β’ Published Oct 17, 2025 β’ 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
Lynx: Towards High-Fidelity Personalized Video Generation Paper β’ 2509.15496 β’ Published Sep 19, 2025 β’ 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper β’ 2506.23552 β’ Published Jun 30, 2025 β’ 10
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9, 2025 β’ 27
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper β’ 2504.00557 β’ Published Apr 1, 2025 β’ 15
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12, 2025 β’ 42
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 3 days ago β’ 43
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper β’ 2412.17739 β’ Published Dec 23, 2024 β’ 41
FastVLM: Efficient Vision Encoding for Vision Language Models Paper β’ 2412.13303 β’ Published Dec 17, 2024 β’ 75
FashionComposer: Compositional Fashion Image Generation Paper β’ 2412.14168 β’ Published Dec 18, 2024 β’ 17