kkaka

kkakkkka

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Lance: Unified Multimodal Modeling by Multi-Task Synergy

upvoted a paper about 1 month ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

upvoted a paper about 2 months ago

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

View all activity

Organizations

upvoted 2 papers about 1 month ago

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Paper • 2605.18678 • Published May 18 • 79

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published May 18 • 116

upvoted a paper about 2 months ago

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

Paper • 2605.14278 • Published May 14 • 37

upvoted a paper 3 months ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Paper • 2603.25823 • Published Mar 26 • 44

upvoted 3 papers 7 months ago

MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment

Paper • 2512.06628 • Published Dec 7, 2025 • 13

AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement

Paper • 2511.23475 • Published Nov 28, 2025 • 43

Controllable Layer Decomposition for Reversible Multi-Layer Image Generation

Paper • 2511.16249 • Published Nov 20, 2025 • 9

upvoted 2 papers 9 months ago

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning

Paper • 2510.11026 • Published Oct 13, 2025 • 18

Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

Paper • 2509.18824 • Published Sep 23, 2025 • 23

upvoted a paper 12 months ago

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14, 2025 • 52