Spaces:
Sleeping
A newer version of the Gradio SDK is available:
6.2.0
title: CV MCP Server
emoji: π»
colorFrom: yellow
colorTo: blue
sdk: gradio
sdk_version: 6.0.1
app_file: app.py
pinned: false
license: mit
short_description: Real-time CV VLM MCP Server for MCP 1st Birthday Hackathon
tags:
- building-mcp-track-creative
π₯ CV MCP Server
A Model Context Protocol (MCP) server that provides real-time scene analysis for webcam images.
This Space allows users to stream live video feeds and get detailed insights about the environment, objects, humans, and more.
Check out the Hackathon details here.
The social media post of this MCP Hackthon project on Discord Instagram Thread.
π¬ Watch a demo of the CV MCP Server analyzing a robotβs camera feed:
Demo Video
GitHub repo of the cv robot python script: https://github.com/OppaAI/CV_Robot_MCP
π Key Features
- Real-time scene description
- Human detection
- Animal and object detection
- Environment classification (indoor/outdoor)
- Lighting condition analysis
- Hazards identification
- Optimized for context window efficiency
π§ How It Works
This Space leverages the MCP server to analyze images captured from your webcam:
- Stream an image from your webcam.
- The image is sent to the MCP server.
- The server processes it using the Model Context Protocol.
- Outputs are returned and displayed in the UI, including:
- General scene description
- Detected humans
- Detected animals and objects
- Environment type (indoor/outdoor)
- Lighting condition
- Hazards
β‘ Demo
Compatible with PC, mobile, and robots with cameras.
Stream images via your webcam or phone camera to receive real-time scene analysis.
Watch a demo video of the CV MCP Server analyzing the video feed from my robot:
Demo Video
π Requirements
- A valid Hugging Face API Token is required.
- If running locally, set your token as an environment variable:
export HF_TOKEN=your_huggingface_token