Spaces:

MCP-1st-Birthday
/

CV_MCP_Server

Sleeping

App Files Files Community

OppaAI commited on Nov 30, 2025

Commit

0bc6e46

verified ·

1 Parent(s): 70d08b0

Update README.md

Browse files

Files changed (1) hide show

README.md +36 -54

README.md CHANGED Viewed

@@ -8,86 +8,68 @@ sdk_version: 6.0.1
 app_file: app.py
 pinned: false
 license: mit
-short_description: CV VLM MCP Server for MCP 1st Birthday party Hackathon
 tags:
 - building-mcp-track-creative
 ---
-Check out the Hackathon details at: https://huggingface.co/MCP-1st-Birthday
-Social media post link:
-https://discord.com/channels/879548962464493619/
-# 🎥 Robot Vision MCP Server
-A Model Context Protocol (MCP) server that provides real-time scene analysis for webcam images. This Space allows users to stream webcam feeds and get detailed information about the environment, objects, humans, and more.
 ---
-## 🌟 Features
-- Real-time scene description
-- Human detection
-- Animals and objects detection
-- Environment classification (indoor/outdoor)
-- Lighting condition analysis
-- Hazards identification
-- Optimized for context window efficiency
 ---
 ## 🔧 How It Works
-The Space uses a MCP server to analyze images captured from your webcam. When an image is streamed:
-1. The image is sent to the MCP server.
-2. The server processes it using the Model Context Protocol.
-3. Outputs are returned and displayed in the UI, including:
-    - General description of the scene
-    - Detected humans
-    - Animals and objects
-    - Environment type (indoor/outdoor)
-    - Lighting condition
-    - Hazards
 ---
 ## ⚡ Demo
-Open the Space and use your webcam to test the real-time scene analysis.
----
-## 🔑 Requirements
-- A valid **Hugging Face API Token** is required.
-- Ensure you set your token as an environment variable `HF_TOKEN` if running locally.
-> **Note:** This project is meant to demonstrate MCP-based vision tools. It was created for educational purposes, and the MCP server may have usage limits.
----
-## 📚 References
-Check out the Hugging Face configuration reference for Spaces: [Spaces Config Reference](https://huggingface.co/docs/hub/spaces-config-reference)
 ---
-## 🚀 Usage
-1. Click the webcam feed in the CV MCP Client: https://huggingface.co/spaces/MCP-1st-Birthday/CV_MCP_Client
-2. The Space will display real-time outputs in the provided textboxes:
-    - Description
-    - Environment
-    - Indoor/Outdoor
-    - Lighting Condition
-    - Human Detected
-    - Animals Detected
-    - Objects Detected
-    - Hazards Identified
----
-## ⚠️ Note
-This project was created as a demo for an MCP-based vision server. While fully functional, heavy usage may incur resource limits. Feel free to explore the code to understand how the MCP server processes webcam feeds and returns detailed scene analysis.

 app_file: app.py
 pinned: false
 license: mit
+short_description: Real-time CV VLM MCP Server for MCP 1st Birthday Hackathon
 tags:
 - building-mcp-track-creative
 ---
+# 🎥 Robot Vision MCP Server
+A **Model Context Protocol (MCP) server** that provides **real-time scene analysis** for webcam images.
+This Space allows users to stream live video feeds and get detailed insights about the environment, objects, humans, and more.
+Check out the Hackathon details [here](https://huggingface.co/MCP-1st-Birthday).
+Join the community discussion on [Discord](https://discord.com/channels/879548962464493619/).
+🎬 Watch a demo of the CV MCP Server analyzing a robot’s camera feed:
+[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
 ---
+## 🌟 Key Features
+- Real-time scene description
+- Human detection
+- Animal and object detection
+- Environment classification (indoor/outdoor)
+- Lighting condition analysis
+- Hazards identification
+- Optimized for **context window efficiency**
 ---
 ## 🔧 How It Works
+This Space leverages the MCP server to analyze images captured from your webcam:
+1. Stream an image from your webcam.
+2. The image is sent to the **MCP server**.
+3. The server processes it using the **Model Context Protocol**.
+4. Outputs are returned and displayed in the UI, including:
+   - General scene description
+   - Detected humans
+   - Detected animals and objects
+   - Environment type (indoor/outdoor)
+   - Lighting condition
+   - Hazards
 ---
 ## ⚡ Demo
+Compatible with **PC, mobile, and robots with cameras**.
+Stream images via your webcam or phone camera to receive real-time scene analysis.
+Watch a demo video of the CV MCP Server analyzing the video feed from my robot:
+[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
 ---
+## 🔑 Requirements
+- A valid **Hugging Face API Token** is required.
+- If running locally, set your token as an environment variable:
+```bash
+export HF_TOKEN=your_huggingface_token