🔄 In a Training Loop

Franklin PRO

Franklinzhang

·

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

Franklinzhang/test-ge

published a dataset 3 days ago

Franklinzhang/test-ge

upvoted a paper 5 days ago

Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning

View all activity

Organizations

upvoted 2 papers 5 days ago

Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning

Paper • 2606.27828 • Published 9 days ago • 24

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Paper • 2606.26740 • Published 10 days ago • 81

upvoted a paper 26 days ago

Echo-Memory: A Controlled Study of Memory in Action World Models

Paper • 2606.09803 • Published 27 days ago • 32

upvoted a paper 28 days ago

WALL-WM: Carving World Action Modeling at the Event Joints

Paper • 2606.01955 • Published Jun 1 • 23

upvoted 2 papers about 1 month ago

Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation

Paper • 2606.04527 • Published Jun 3 • 28

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published May 17 • 34

upvoted 7 papers about 2 months ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Paper • 2605.15055 • Published May 14 • 19

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published May 14 • 96

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Paper • 2512.18181 • Published May 7 • 87

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published Apr 15 • 25

upvoted a paper 2 months ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published May 1 • 86

upvoted 6 papers 3 months ago

ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning

Paper • 2603.28610 • Published Mar 30 • 20

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published Mar 30 • 58

The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus

Paper • 2603.20105 • Published Mar 20 • 37

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 12

RealMaster: Lifting Rendered Scenes into Photorealistic Video

Paper • 2603.23462 • Published Mar 24 • 33

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Paper • 2603.24329 • Published Mar 25 • 28