Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chi Chen's picture
4 18 7

Chi Chen

carboncoo
Oscar-dzy's profile picture MaxyLee's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
liked a model about 1 month ago
openbmb/MiniCPM-o-4_5
upvoted a paper about 2 months ago
MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?
View all activity

Organizations

Machine Translation Group, Natural Language Processing Lab at Tsinghua University's profile picture

commented a paper 10 months ago

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

Paper • 2505.20715 • Published May 27, 2025 • 2 •
2
commented 2 papers 12 months ago

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Paper • 2503.23733 • Published Mar 31, 2025 • 10 •
3

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17, 2025 • 32 •
2
commented a paper about 1 year ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10, 2025 • 29 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs