Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,88 +8,67 @@ sdk_version: 6.0.1
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
-
short_description:
|
| 12 |
tags:
|
| 13 |
- mcp-in-action-track-creative
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
| 17 |
|
| 18 |
-
|
| 19 |
-
https://discord.com/channels/879548962464493619/
|
| 20 |
|
| 21 |
-
|
|
|
|
| 22 |
|
| 23 |
-
|
|
|
|
| 24 |
|
| 25 |
---
|
| 26 |
|
| 27 |
-
## π Features
|
| 28 |
|
| 29 |
-
- Real-time scene description
|
| 30 |
-
- Human detection
|
| 31 |
-
-
|
| 32 |
-
- Environment classification (indoor/outdoor)
|
| 33 |
-
- Lighting condition analysis
|
| 34 |
-
- Hazards identification
|
| 35 |
-
- Optimized for context window
|
| 36 |
|
| 37 |
---
|
| 38 |
|
| 39 |
## π§ How It Works
|
| 40 |
|
| 41 |
-
This Gradio Space
|
| 42 |
|
| 43 |
-
1. Capture an image
|
| 44 |
-
2. Convert the image to Base64 format
|
| 45 |
-
3. Send the image along with your Hugging Face token to the MCP server.
|
| 46 |
4. Receive detailed scene analysis, including:
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
|
| 53 |
|
| 54 |
---
|
| 55 |
|
| 56 |
## β‘ Demo
|
| 57 |
|
| 58 |
-
|
| 59 |
-
Use your webcam/phone camera to stream images and receive real-time scene analysis.
|
| 60 |
|
| 61 |
-
|
| 62 |
-
https://photos.app.goo.gl/guxui1EsdPNoL4mw7
|
| 63 |
|
|
|
|
|
|
|
| 64 |
|
| 65 |
---
|
| 66 |
|
| 67 |
## π Requirements
|
| 68 |
|
| 69 |
-
- **Hugging Face API Token** is required for
|
| 70 |
-
- Set your token as an environment variable
|
| 71 |
-
- Without a valid token, the app will display an error message.
|
| 72 |
|
| 73 |
-
|
| 74 |
-
|
| 75 |
-
---
|
| 76 |
-
|
| 77 |
-
## π Usage
|
| 78 |
-
|
| 79 |
-
1. Open the Space and allow webcam access.
|
| 80 |
-
2. Stream images via the webcam.
|
| 81 |
-
3. View the outputs in the textboxes:
|
| 82 |
-
- Description
|
| 83 |
-
- Environment
|
| 84 |
-
- Indoor/Outdoor
|
| 85 |
-
- Lighting Condition
|
| 86 |
-
- Human Detected
|
| 87 |
-
- Animals Detected
|
| 88 |
-
- Objects Detected
|
| 89 |
-
- Hazards Identified
|
| 90 |
-
|
| 91 |
-
---
|
| 92 |
-
|
| 93 |
-
## β οΈ Note
|
| 94 |
-
|
| 95 |
-
This project demonstrates a client-side MCP integration. The token used in this Space is tied to the deployer's Hugging Face account. To avoid exposing your token to the public, consider using OAuth login so users can use their own Hugging Face tokens.
|
|
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
+
short_description: Computer Vision MCP Client for MCP 1st Birthday Hackathon
|
| 12 |
tags:
|
| 13 |
- mcp-in-action-track-creative
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# π₯ CV MCP Client
|
| 17 |
|
| 18 |
+
A **Computer Vision Model Context Protocol (MCP) Client** that streams webcam images and provides detailed scene analysis in real time, designed for the **MCP 1st Birthday Hackathon**.
|
|
|
|
| 19 |
|
| 20 |
+
Check out the Hackathon details [here](https://huggingface.co/MCP-1st-Birthday).
|
| 21 |
+
Join the community discussion on [Discord](https://discord.com/channels/879548962464493619/).
|
| 22 |
|
| 23 |
+
Demo video of the CV MCP Server analyzing the video feed from my robot:
|
| 24 |
+
[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
|
| 25 |
|
| 26 |
---
|
| 27 |
|
| 28 |
+
## π Key Features
|
| 29 |
|
| 30 |
+
- Real-time scene description
|
| 31 |
+
- Human detection
|
| 32 |
+
- Animal and object detection
|
| 33 |
+
- Environment classification (indoor/outdoor)
|
| 34 |
+
- Lighting condition analysis
|
| 35 |
+
- Hazards identification
|
| 36 |
+
- Optimized for efficient context window usage
|
| 37 |
|
| 38 |
---
|
| 39 |
|
| 40 |
## π§ How It Works
|
| 41 |
|
| 42 |
+
This Gradio Space interacts with an MCP server to analyze webcam images:
|
| 43 |
|
| 44 |
+
1. Capture an image from your webcam.
|
| 45 |
+
2. Convert the image to **Base64 format**.
|
| 46 |
+
3. Send the image along with your **Hugging Face API token** to the MCP server.
|
| 47 |
4. Receive detailed scene analysis, including:
|
| 48 |
+
- Scene description
|
| 49 |
+
- Detected humans
|
| 50 |
+
- Detected animals and objects
|
| 51 |
+
- Environment type (indoor/outdoor)
|
| 52 |
+
- Lighting condition
|
| 53 |
+
- Hazards
|
| 54 |
|
| 55 |
---
|
| 56 |
|
| 57 |
## β‘ Demo
|
| 58 |
|
| 59 |
+
Compatible with **PC, mobile, and robots with cameras**.
|
|
|
|
| 60 |
|
| 61 |
+
Stream images via your webcam or phone camera to receive real-time scene analysis.
|
|
|
|
| 62 |
|
| 63 |
+
Watch a demo video of the CV MCP Server analyzing the video feed from my robot:
|
| 64 |
+
[Demo Video](https://photos.app.goo.gl/guxui1EsdPNoL4mw7)
|
| 65 |
|
| 66 |
---
|
| 67 |
|
| 68 |
## π Requirements
|
| 69 |
|
| 70 |
+
- **Hugging Face API Token** is required for MCP server access.
|
| 71 |
+
- Set your token as an environment variable:
|
|
|
|
| 72 |
|
| 73 |
+
```bash
|
| 74 |
+
export HF_TOKEN=your_huggingface_token
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|