Kevin's picture

Kevin

kvnptl

·

kvnptl

AI & ML interests

Robot perception

Recent Activity

liked a Space 9 days ago

lerobot/visualize_dataset

updated a dataset 9 days ago

kvnptl/so101-teleop-vials-to-rack-real

published a dataset 9 days ago

kvnptl/so101-teleop-vials-to-rack-real

View all activity

Organizations

None yet

liked a Space 9 days ago

Visualize Dataset (v2.0+ latest dataset format)

Visualize LeRobot datasets with interactive charts and tools

updated a dataset 9 days ago

kvnptl/so101-teleop-vials-to-rack-real

Viewer • Updated 9 days ago • 106k • 209

published a dataset 9 days ago

kvnptl/so101-teleop-vials-to-rack-real

Viewer • Updated 9 days ago • 106k • 209

liked a model 14 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 14 days ago • 495k • 2.37k

updated a dataset 21 days ago

kvnptl/so101-teleop-ballpen-to-rack-real

Viewer • Updated 21 days ago • 54.8k • 102

published a dataset 21 days ago

kvnptl/so101-teleop-ballpen-to-rack-real

Viewer • Updated 21 days ago • 54.8k • 102

liked 2 models 3 months ago

nvidia/Cosmos-Predict2.5-2B

Updated Mar 3 • 79.2k • 136

nvidia/Cosmos-Guardrail1

Updated Apr 1, 2025 • 2.44k • 27

liked a model 4 months ago

nvidia/Cosmos-Embed1-448p

1B • Updated Mar 13 • 10.9k • 12

upvoted 3 articles 4 months ago

Article

Deploying Open Source Vision Language Models (VLM) on Jetson

nvidia

•

Feb 24

• 37

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

nvidia

•

Jan 5

• 64

Article

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

nvidia

•

Jan 29

• 48

liked a model 8 months ago

facebook/EdgeTAM

Updated Apr 30, 2025 • 31

liked a Space 8 months ago

HunyuanWorld-Mirror

Universal 3D World Reconstruction with Any Prior Prompting

liked a Space about 1 year ago

RF-DETR

SOTA real-time object detection model

upvoted an article over 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 419

reacted to maxiw's post with 👍 over 1 year ago

Post

3930

The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.

You can try it out with my space maxiw/Qwen2-VL-Detection

6 replies

·

upvoted an article over 1 year ago

Article

Welcome PaliGemma 2 – New vision language models by Google

+2

merve, andsteing, pcuenq, ariG23498

•

Dec 5, 2024

• 166

liked 2 datasets over 1 year ago

uoft-cs/cifar10

Viewer • Updated Jan 4, 2024 • 60k • 178k • 107

Francesco/animals-ij5d2

Viewer • Updated Mar 30, 2023 • 1k • 159 • 16