Walter Hugo Lopez Pinaya's picture

Walter Hugo Lopez Pinaya

Warvito

·

AI & ML interests

None yet

Recent Activity

updated a collection about 10 hours ago

updated a collection 8 days ago

upvoted a paper 8 days ago

Next-Latent Prediction Transformers Learn Compact World Models

View all activity

Organizations

upvoted a paper 8 days ago

Next-Latent Prediction Transformers Learn Compact World Models

Paper • 2511.05963 • Published Nov 8, 2025 • 3

upvoted a paper 9 days ago

MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation

Paper • 2606.09056 • Published 19 days ago • 6

upvoted a paper 25 days ago

Bernini: Latent Semantic Planning for Video Diffusion

Paper • 2605.22344 • Published May 21 • 19

upvoted 5 papers about 1 month ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published May 20 • 111

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Paper • 2605.23902 • Published May 22 • 46

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published Mar 14 • 91

Steerable Visual Representations

Paper • 2604.02327 • Published Apr 2 • 56

upvoted 4 papers about 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 224

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published Apr 27 • 71

Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation

Paper • 2604.19141 • Published Apr 21 • 1

upvoted 5 papers 2 months ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 244

DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization

Paper • 2510.12691 • Published Dec 20, 2025 • 1

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

Paper • 2603.24458 • Published Mar 25 • 10

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published Apr 9 • 82

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published Apr 2 • 152

upvoted 3 papers 3 months ago

Towards a Medical AI Scientist

Paper • 2603.28589 • Published Mar 30 • 91

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 148

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 107