-
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
Paper • 2504.00999 • Published • 96 -
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation
Paper • 2503.24379 • Published • 76 -
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1
Paper • 2503.24376 • Published • 38 -
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
Paper • 2503.21614 • Published • 43
Jianfeng Hua
jianfenghua
AI & ML interests
None yet
Recent Activity
upvoted a paper about 6 hours ago
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing upvoted a paper about 6 hours ago
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale upvoted a paper about 6 hours ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining Organizations
None yet