Spaces:

Agents-MCP-Hackathon
/

Min-StyleMatch-AI

Sleeping

App Files Files Community

Min-StyleMatch-AI / README.md

minkhant96

Update README.md

00148db verified 9 months ago

preview code

raw

history blame contribute delete

7.52 kB

	---
	title: Min StyleMatch AI
	emoji: 📉
	colorFrom: gray
	colorTo: green
	sdk: gradio
	sdk_version: 5.33.1
	app_file: app.py
	pinned: false
	license: apache-2.0
	short_description: LLM-Powered Styling & Color Analysis via MCP Server + Gradio
	tags:
	- Agents-MCP-Hackathon
	- agent-demo-track
	- mcp-server-track
	---

	# Project Name: Min-StyleMatch AI – LLM-Powered Styling & Color Analysis via MCP Server + Gradio Client

	---
	## 🎬 Demo Video

	[![Watch the Demo on YouTube](https://img.youtube.com/vi/VvRdsgGUVJQ/0.jpg)](https://www.youtube.com/watch?v=VvRdsgGUVJQ)

	---
	## 📚 Table of Contents

	- [🧠 Project Overview](#-project-overview)
	- [🔧 Tools Integrated](#-tools-integrated)
	- [⚠️ Project Status](#️-note-this-project-is-not-production-ready)
	- [🔍 Models Used](#-models-used)
	- [📦 Open-Source Models (via Nebius)](#-open-source-models-via-nebius)
	- [🔐 Closed-Source Models (via OpenAI)](#-closed-source-models-via-openai)
	- [🧩 System Architecture Overview](#-system-architecture-overview)
	- [🚀 Quickstart](#-quickstart)
	- [🔐 Your Data & Privacy](#-your-data--privacy)
	- [🧠 Chat History](#-chat-history)
	- [🖼️ Image Handling](#️-image-handling)
	- [🎨 Facial Color Data](#-facial-color-data)
	- [✅ Dataset Overview](#-dataset-overview)
	- [🔄 Data Processing Steps](#-data-processing-steps)
	- [📦 Output](#-output)

	---

	## 🧠 Project Overview

	This project uses FastMCP for custom MCP (Model Context Protocol) server and a Gradio Chatbot as an MCP Client, designed to demonstrate how Language Models (LLMs) can interact with tools for advanced reasoning and personalization.

	### 🔧 Tools Integrated

	1. 🧑‍💻 Developer Info Tool

	* Returns information about the developer (you).
	* Purpose: Demonstrates how MCP tool calling supports prompt decomposition, allowing the LLM to delegate parts of the task to improve response accuracy and structure.

	2. 🛍️ Product Filtering Tool

	* Connects the LLM with product data stored in `.csv` files (used as a lightweight database).
	* Purpose: Showcases how LLMs can support product recommendation systems by filtering based on user preferences, color suitability, or product attributes.

	3. 🎨 Color Season Analysis Tool

	* Operates in an Agent-based setup using two LLMs:

	* One extracts facial feature colors (e.g., skin, eyes, hair, lips) from a user-uploaded image.
	* The second uses this data to predict the user’s color season.
	* Purpose: Demonstrates multi-model collaboration, improving accuracy by separating image understanding from season classification.
	(Combining both steps into one model often resulted in inconsistent or random outputs.)

	---

	> ⚠️ Note: This project is not production-ready.
	> It is a demo prototype built to:
	>
	> * Showcase how to integrate product data with LLMs using tool calling.
	> * Demonstrate multi-model collaboration (e.g., using image and text models in tandem).
	> * Illustrate prompt decomposition and how MCP (Model Context Protocol) enables modular reasoning through tool-based workflows.
	>
	> While functional, the current version has non-optimized system flow, basic tool logic, and initial prompt configurations.
	> These areas can be significantly improved for better performance, scalability, and production-readiness.
	>
	> This project is intended as a learning resource and a practical starting point for junior developers, LLM enthusiasts, or teams exploring how to combine AI tools with real-world datasets.
	>
	> With future iterations—such as improved orchestration, tighter prompt design, and scalable backend architecture—it can evolve into a robust, intelligent personalization system.

	---

	## 🔍 Models Used

	### 📦 Open-Source Models (via Nebius)

	* Qwen2.5-VL-72B-Instruct – For visual-linguistic processing.
	* Mistral-Nemo-Instruct-2407 – For general instruction-following tasks.

	### 🔐 Closed-Source Models (via OpenAI)

	* GPT-4o-mini (primary for tool-calling)
	* GPT-4.1-mini

	> ⚠️ Note: Currently, only OpenAI models are used for tool calling, as their API is more mature and familiar.
	> I have not explored whether Nebius supports tool calling.

	---
	## 🧩 System Architecture Overview

	```mermaid
	graph TD
	User["🧑 User"]
	Gradio["🤖 Gradio MCP Client"]
	FastMCP["🛠️ FastMCP Server"]
	Tool1["🧑‍💻 Developer Info Tool"]
	Tool2["🛍️ Product Filter Tool"]
	Tool3["🎨 Color Season Tool"]
	LLM["🧠 LLM (OpenAI / Nebius)"]

	User --> Gradio
	Gradio --> FastMCP
	FastMCP --> Tool1
	FastMCP --> Tool2
	FastMCP --> Tool3
	Tool3 --> LLM
	Tool1 --> LLM
	Tool2 --> LLM
	```

	---
	## 🚀 Quickstart

	1. Clone the repository
	```bash
	git clone https://github.com/yourusername/stylematch-assistant.git
	cd stylematch-assistant
	pip install -r requirements.txt
	```

	---

	## 🔐 Your Data & Privacy

	This project requires the use of the OpenAI API and optionally the Nebius AI Studio API. All API keys are encrypted and never exposed in plain text.

	This is a demo project, and for your safety and control, I recommend cloning the repository and running it locally on your machine.

	### 🧠 Chat History

	* Your conversation is not stored permanently.
	* Chat history will be cleared if you refresh the browser or close the session.

	### 🖼️ Image Handling

	* When you upload a face image, it is stored temporarily for analysis.
	* If you're inactive or stop interacting with the assistant, the image will be automatically deleted.
	* No facial image is permanently saved.

	### 🎨 Facial Color Data

	* The extracted skin, hair, eyes, lips colors, and your color season result are handled just like your image—temporary and private.
	* This information is only used to enhance your personalized experience and is not retained after your session ends.

	---

	## ✅ Dataset Overview
	This project uses the [Fashion Product Images Dataset](https://www.kaggle.com/datasets/paramaggarwal/fashion-product-images-dataset) from Kaggle, which contains metadata and image filenames for over 44,000 fashion products.

	The dataset consists of two key files:

	* `styles.csv` – contains product metadata such as category, gender, and usage.
	* `images.csv` – contains filenames of product images, linked to the product via the `id` column.


	### 🔄 Data Processing Steps

	The function `load_combined_dataset(styles_path, images_path)` performs the following operations:

	1. Load CSV Files

	* Reads `styles.csv` and `images.csv`.
	* Skips malformed lines automatically to avoid read errors.

	2. Prepare for Merge

	* Cleans the `filename` column in `images.csv` by stripping `.jpg` and converting it into integer `id` for merging.

	3. Data Cleaning

	* Drops all rows with missing (`NaN`) values from both DataFrames.
	* Merges the two DataFrames on the `id` column using an inner join.
	* Ensures the resulting combined dataset has no null values.

	4. Filter Relevant Categories

	* Since this is a demo, I keeps only the products that belong to Apparel or Footwear categories (ignores others such as Accessories).

	### 📦 Output

	The function returns a cleaned and filtered `pandas.DataFrame` containing only products ready for personalized styling and analysis.

	---