james curry

ainbo

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

upvoted a paper 3 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

upvoted a paper 9 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

View all activity

Organizations

upvoted a paper about 9 hours ago

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Paper • 2606.25473 • Published 1 day ago • 10

upvoted a paper 3 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 9 days ago • 60

upvoted a paper 9 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 16 days ago • 201

upvoted a paper 16 days ago

Echo-Memory: A Controlled Study of Memory in Action World Models

Paper • 2606.09803 • Published 18 days ago • 32

upvoted a paper 21 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 25 days ago • 135

upvoted 3 papers 28 days ago

upvoted a paper 29 days ago

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Paper • 2605.25979 • Published May 25 • 27

upvoted 7 papers about 1 month ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published May 20 • 111

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 115

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published May 14 • 64

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published May 13 • 105

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published May 12 • 30

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

upvoted 4 papers about 2 months ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 55

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 84

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published Apr 29 • 112

james curry

AI & ML interests

Recent Activity

Organizations

ainbo's activity