Daeun Lee's picture

Daeun Lee

danaleee

·

https://daeunni.github.io/

AI & ML interests

None yet

Recent Activity

published a model about 5 hours ago

danaleee/VisionCoach

upvoted a paper 3 months ago

EgoLCD: Egocentric Video Generation with Long Context Diffusion

upvoted a paper 3 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

View all activity

Organizations

upvoted 4 papers 3 months ago

EgoLCD: Egocentric Video Generation with Long Context Diffusion

Paper • 2512.04515 • Published Dec 4, 2025 • 6

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

PRInTS: Reward Modeling for Long-Horizon Information Seeking

Paper • 2511.19314 • Published Nov 24, 2025 • 8

StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

Paper • 2512.01707 • Published Dec 1, 2025 • 8

upvoted a paper 5 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 52

upvoted a paper 9 months ago

Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning

Paper • 2506.03525 • Published Jun 4, 2025 • 6

upvoted a paper 10 months ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30, 2025 • 46

upvoted 2 papers 11 months ago

Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

Paper • 2504.13143 • Published Apr 17, 2025 • 7

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted 2 papers about 1 year ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20, 2025 • 45

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published Jan 21, 2025 • 48

upvoted 2 papers over 1 year ago

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation

Paper • 2402.08712 • Published Feb 13, 2024 • 1

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement

Paper • 2411.15115 • Published Nov 22, 2024 • 10