Zhen Fang

CostaliyA

9 97 4

https://costaliya.github.io/

CostaliyA

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

In-Context World Modeling for Robotic Control

upvoted a paper 2 days ago

DanceOPD: On-Policy Generative Field Distillation

upvoted a paper 4 days ago

Qwen-AgentWorld: Language World Models for General Agents

View all activity

Organizations

None yet

upvoted a paper 1 day ago

In-Context World Modeling for Robotic Control

Paper • 2606.26025 • Published 5 days ago • 57

upvoted a paper 2 days ago

DanceOPD: On-Policy Generative Field Distillation

Paper • 2606.27377 • Published 5 days ago • 73

upvoted a paper 4 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 7 days ago • 139

upvoted a paper 10 days ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 17 days ago • 74

upvoted a paper 13 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 20 days ago • 205

upvoted 2 papers 14 days ago

HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers

Paper • 2606.13289 • Published 19 days ago • 29

Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack

Paper • 2606.14409 • Published 18 days ago • 15

upvoted 2 papers 17 days ago

i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models

Paper • 2606.11289 • Published 21 days ago • 16

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 19 days ago • 82

upvoted a paper 18 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 19 days ago • 142

upvoted 3 papers 19 days ago

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Paper • 2606.07689 • Published 25 days ago • 5

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Paper • 2606.07512 • Published 25 days ago • 39

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 21 days ago • 41

upvoted a paper 20 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 22 days ago • 46

upvoted a paper 23 days ago

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Paper • 2606.06042 • Published 26 days ago • 24

upvoted a paper 24 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published 27 days ago • 39

upvoted 4 papers 25 days ago

Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

Paper • 2606.02684 • Published 29 days ago • 16

Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation

Paper • 2606.04527 • Published 27 days ago • 28

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 29 days ago • 57

Qwen-Image-Flash: Beyond Objective Design

Paper • 2606.03746 • Published 28 days ago • 36

Zhen Fang

AI & ML interests

Recent Activity

Organizations

CostaliyA's activity