Vision-Language-Action models for end-to-end robotic control. SmolVLA, RDT2-FM action generation.
AI & ML interests
None defined yet.
Recent Activity
INT4 vision-language models for robotic scene understanding. Qwen2.5-VL for visual QA and grounding.
INT8 quantized vision models for real-time robotic perception. SAM2, DINOv2, CLIP, SigLIP, Depth Anything.
-
robotflowlabs/clip-vit-large-patch14-int8
Zero-Shot Image Classification • Updated • 11 -
robotflowlabs/sam2.1-hiera-large-int8
Image Segmentation • Updated • 12 -
robotflowlabs/sam2.1-hiera-small-int8
Image Segmentation • Updated • 11 -
robotflowlabs/sam2.1-hiera-tiny-int8
Image Segmentation • Updated • 12
Vision-Language-Action models for end-to-end robotic control. SmolVLA, RDT2-FM action generation.
INT4 vision-language models for robotic scene understanding. Qwen2.5-VL for visual QA and grounding.
INT4 quantized language models for robotic reasoning. Qwen2.5, SmolLM2 optimized for edge deployment.
INT8 quantized vision models for real-time robotic perception. SAM2, DINOv2, CLIP, SigLIP, Depth Anything.
-
robotflowlabs/clip-vit-large-patch14-int8
Zero-Shot Image Classification • Updated • 11 -
robotflowlabs/sam2.1-hiera-large-int8
Image Segmentation • Updated • 12 -
robotflowlabs/sam2.1-hiera-small-int8
Image Segmentation • Updated • 11 -
robotflowlabs/sam2.1-hiera-tiny-int8
Image Segmentation • Updated • 12