CV_MCP_Client / README.md
OppaAI's picture
Update README.md
eb0bfb4 verified

A newer version of the Gradio SDK is available: 6.2.0

Upgrade
metadata
title: CV MCP Client
emoji: 🐒
colorFrom: indigo
colorTo: red
sdk: gradio
sdk_version: 6.0.1
app_file: app.py
pinned: false
license: mit
short_description: Computer Vision MCP Client for MCP 1st Birthday Hackathon
tags:
  - mcp-in-action-track-creative

πŸŽ₯ CV MCP Client

A Computer Vision Model Context Protocol (MCP) Client that streams webcam images and provides detailed scene analysis in real time, designed for the MCP 1st Birthday Hackathon.

Check out the Hackathon details here.

The social media post of this MCP Hackthon project on Discord Instagram Thread.

Demo video of the CV MCP Server analyzing the video feed from my robot:
Demo Video

GitHub repo of the cv robot python script: https://github.com/OppaAI/CV_Robot_MCP


🌟 Key Features

  • Real-time scene description
  • Human detection
  • Animal and object detection
  • Environment classification (indoor/outdoor)
  • Lighting condition analysis
  • Hazards identification
  • Optimized for efficient context window usage

πŸ”§ How It Works

This Gradio Space interacts with an MCP server to analyze webcam images:

  1. Capture an image from your webcam.
  2. Convert the image to Base64 format.
  3. Send the image along with your Hugging Face API token to the MCP server.
  4. Receive detailed scene analysis, including:
    • Scene description
    • Detected humans
    • Detected animals and objects
    • Environment type (indoor/outdoor)
    • Lighting condition
    • Hazards

⚑ Demo

Compatible with PC, mobile, and robots with cameras.

Stream images via your webcam or phone camera to receive real-time scene analysis.

Watch a demo video of the CV MCP Server analyzing the video feed from my robot:
Demo Video


πŸ”‘ Requirements

  • Hugging Face API Token is required for MCP server access.
  • Set your token as an environment variable:
export HF_TOKEN=your_huggingface_token