Spaces:

Hanze-Qiu
/

Assignment_2

Build error

Assignment_2 / README.md

Upload 6 files

ea9e3c0 verified about 1 month ago

1.45 kB

title: Car Parts Image-to-Video Retrieval
emoji: 🚗
colorFrom: blue
colorTo: green
sdk: docker
pinned: false
license: mit

Car Parts Image-to-Video Retrieval System

An intelligent system that detects car parts in images and retrieves matching video clips from an indexed automotive video.

YOLOv26s Detection: Fine-tuned on car parts dataset
Semantic Matching: Identifies doors, wheels, headlights, mirrors, bumpers, and more
Temporal Retrieval: Returns precise video clip timestamps
Interactive Demo: Upload any car image and find matching video segments

Model: YOLOv26s (small variant) fine-tuned for car part detection
Video Index: Pre-computed detection index with bounding boxes and timestamps
Sampling Strategy: Every 5th frame (4.8-6 FPS effective rate)
Clip Formation: 3.0s gap threshold for temporal merging

This demo is part of Assignment 2 for CS-UY 4613 Artificial Intelligence (Spring 2026).

Student: Hanze (James) Qiu
Repository: github.com/JamesQiu2005/CS-UY_4613_Assignments

Built with Ultralytics YOLO, OpenCV, and Gradio.