Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper β’ 2603.03143 β’ Published 10 days ago β’ 133
Running 3.74k The Ultra-Scale Playbook π 3.74k The ultimate guide to training LLM on large GPU Clusters
Helios: Real Real-Time Long Video Generation Model Paper β’ 2603.04379 β’ Published 8 days ago β’ 159
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published 20 days ago β’ 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 21 days ago β’ 483
PaperBanana: Automating Academic Illustration for AI Scientists Paper β’ 2601.23265 β’ Published Jan 30 β’ 216
Latent Diffusion Model without Variational Autoencoder Paper β’ 2510.15301 β’ Published Oct 17, 2025 β’ 49
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper β’ 2510.15742 β’ Published Oct 17, 2025 β’ 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
Lynx: Towards High-Fidelity Personalized Video Generation Paper β’ 2509.15496 β’ Published Sep 19, 2025 β’ 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper β’ 2506.23552 β’ Published Jun 30, 2025 β’ 10
Running on Zero MCP Featured 323 Chain-of-Zoom π 323 Extreme Super-Resolution via Scale Autoregression
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9, 2025 β’ 27
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9, 2025 β’ 27 β’ 2
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9, 2025 β’ 27
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper β’ 2504.00557 β’ Published Apr 1, 2025 β’ 15 β’ 2