Peng Liu's picture

Peng Liu

P3ngLiu

·

P3ngLiu

AI & ML interests

CV, Multimodal, OVD

Recent Activity

upvoted an article about 5 hours ago

VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction

updated a collection 9 days ago

OmDet-Turbo-Models

updated a model 9 days ago

omlab/VLM-FO1-3B-v01

View all activity

Organizations

upvoted an article about 5 hours ago

Article

VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction

tianchez

•

about 5 hours ago

• 6

upvoted a paper 25 days ago

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

Paper • 2605.28132 • Published about 1 month ago • 25

upvoted 2 articles 11 months ago

Article

Improving Object Detection through Reinforcement Learning with VLM-R1

omlab

•

Mar 25, 2025

• 3

Article

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

omlab

•

Mar 25, 2025

• 3

upvoted a paper about 1 year ago

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10, 2025 • 36