Chi Chen's picture

Chi Chen

carboncoo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning

upvoted a paper 3 months ago

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

upvoted a paper 4 months ago

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

View all activity

Organizations

commented 2 papers about 1 year ago

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

Paper • 2505.20715 • Published May 27, 2025 • 2 •

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Paper • 2503.23733 • Published Mar 31, 2025 • 10 •

commented 2 papers over 1 year ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17, 2025 • 32 •

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10, 2025 • 29 •