Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,86 +8,68 @@ sdk_version: 6.0.1
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
-
short_description: CV VLM MCP Server for MCP 1st Birthday
|
| 12 |
tags:
|
| 13 |
- building-mcp-track-creative
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
| 17 |
|
| 18 |
-
|
| 19 |
-
|
| 20 |
|
| 21 |
-
|
|
|
|
| 22 |
|
| 23 |
-
|
|
|
|
| 24 |
|
| 25 |
---
|
| 26 |
|
| 27 |
-
## π Features
|
| 28 |
|
| 29 |
-
- Real-time scene description
|
| 30 |
-
- Human detection
|
| 31 |
-
-
|
| 32 |
-
- Environment classification (indoor/outdoor)
|
| 33 |
-
- Lighting condition analysis
|
| 34 |
-
- Hazards identification
|
| 35 |
-
- Optimized for context window efficiency
|
| 36 |
|
| 37 |
---
|
| 38 |
|
| 39 |
## π§ How It Works
|
| 40 |
|
| 41 |
-
|
| 42 |
|
| 43 |
-
1.
|
| 44 |
-
2. The
|
| 45 |
-
3.
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
|
|
|
| 52 |
|
| 53 |
---
|
| 54 |
|
| 55 |
## β‘ Demo
|
| 56 |
|
| 57 |
-
|
| 58 |
-
|
| 59 |
-
---
|
| 60 |
-
|
| 61 |
-
## π Requirements
|
| 62 |
-
|
| 63 |
-
- A valid **Hugging Face API Token** is required.
|
| 64 |
-
- Ensure you set your token as an environment variable `HF_TOKEN` if running locally.
|
| 65 |
|
| 66 |
-
|
| 67 |
|
| 68 |
-
|
| 69 |
-
|
| 70 |
-
## π References
|
| 71 |
-
|
| 72 |
-
Check out the Hugging Face configuration reference for Spaces: [Spaces Config Reference](https://huggingface.co/docs/hub/spaces-config-reference)
|
| 73 |
|
| 74 |
---
|
| 75 |
|
| 76 |
-
##
|
| 77 |
-
|
| 78 |
-
1. Click the webcam feed in the CV MCP Client: https://huggingface.co/spaces/MCP-1st-Birthday/CV_MCP_Client
|
| 79 |
-
2. The Space will display real-time outputs in the provided textboxes:
|
| 80 |
-
- Description
|
| 81 |
-
- Environment
|
| 82 |
-
- Indoor/Outdoor
|
| 83 |
-
- Lighting Condition
|
| 84 |
-
- Human Detected
|
| 85 |
-
- Animals Detected
|
| 86 |
-
- Objects Detected
|
| 87 |
-
- Hazards Identified
|
| 88 |
-
|
| 89 |
-
---
|
| 90 |
|
| 91 |
-
|
|
|
|
| 92 |
|
| 93 |
-
|
|
|
|
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
+
short_description: Real-time CV VLM MCP Server for MCP 1st Birthday Hackathon
|
| 12 |
tags:
|
| 13 |
- building-mcp-track-creative
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# π₯ Robot Vision MCP Server
|
| 17 |
|
| 18 |
+
A **Model Context Protocol (MCP) server** that provides **real-time scene analysis** for webcam images.
|
| 19 |
+
This Space allows users to stream live video feeds and get detailed insights about the environment, objects, humans, and more.
|
| 20 |
|
| 21 |
+
Check out the Hackathon details [here](https://huggingface.co/MCP-1st-Birthday).
|
| 22 |
+
Join the community discussion on [Discord](https://discord.com/channels/879548962464493619/).
|
| 23 |
|
| 24 |
+
π¬ Watch a demo of the CV MCP Server analyzing a robotβs camera feed:
|
| 25 |
+
[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
|
| 26 |
|
| 27 |
---
|
| 28 |
|
| 29 |
+
## π Key Features
|
| 30 |
|
| 31 |
+
- Real-time scene description
|
| 32 |
+
- Human detection
|
| 33 |
+
- Animal and object detection
|
| 34 |
+
- Environment classification (indoor/outdoor)
|
| 35 |
+
- Lighting condition analysis
|
| 36 |
+
- Hazards identification
|
| 37 |
+
- Optimized for **context window efficiency**
|
| 38 |
|
| 39 |
---
|
| 40 |
|
| 41 |
## π§ How It Works
|
| 42 |
|
| 43 |
+
This Space leverages the MCP server to analyze images captured from your webcam:
|
| 44 |
|
| 45 |
+
1. Stream an image from your webcam.
|
| 46 |
+
2. The image is sent to the **MCP server**.
|
| 47 |
+
3. The server processes it using the **Model Context Protocol**.
|
| 48 |
+
4. Outputs are returned and displayed in the UI, including:
|
| 49 |
+
- General scene description
|
| 50 |
+
- Detected humans
|
| 51 |
+
- Detected animals and objects
|
| 52 |
+
- Environment type (indoor/outdoor)
|
| 53 |
+
- Lighting condition
|
| 54 |
+
- Hazards
|
| 55 |
|
| 56 |
---
|
| 57 |
|
| 58 |
## β‘ Demo
|
| 59 |
|
| 60 |
+
Compatible with **PC, mobile, and robots with cameras**.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 61 |
|
| 62 |
+
Stream images via your webcam or phone camera to receive real-time scene analysis.
|
| 63 |
|
| 64 |
+
Watch a demo video of the CV MCP Server analyzing the video feed from my robot:
|
| 65 |
+
[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
|
|
|
|
|
|
|
|
|
|
| 66 |
|
| 67 |
---
|
| 68 |
|
| 69 |
+
## π Requirements
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70 |
|
| 71 |
+
- A valid **Hugging Face API Token** is required.
|
| 72 |
+
- If running locally, set your token as an environment variable:
|
| 73 |
|
| 74 |
+
```bash
|
| 75 |
+
export HF_TOKEN=your_huggingface_token
|