multimodal-chatbot / README.md
elsayedelmandoh's picture
upload project
a1660ff
---
title: Multimodal Chatbot
emoji: πŸ†
colorFrom: green
colorTo: red
sdk: gradio
sdk_version: 6.5.1
app_file: app.py
pinned: false
---
# Build and Deploy a Multimodal Chatbot
## Table of Contents
- [Overview](#overview)
- [Project Structure](#project-structure)
- [Setup](#setup)
- [Clone from GitHub](#clone-from-github)
- [Conda environment](#conda-environment)
- [Run locally](#run-locally)
- [Duplicate Hugging Face Space](#duplicate-hugging-face-space)
- [Contributing](#contributing)
- [Author](#author)
## Overview
This is a multimodal chatbot application that leverages advanced image analysis and computer vision capabilities. It accepts both text and image inputs, analyzing visual content in detail to understand composition, objects, and contextual elements. The chatbot processes uploaded images through state-of-the-art AI models, providing specific descriptions and insights that can be used across machine learning, computer vision, and multimodal AI applications. Perfect for tasks requiring visual understanding combined with conversational AI.
## Project Structure
```
web-search/
β”œβ”€β”€ app.py
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ README.md
β”œβ”€β”€ .gitignore
β”œβ”€β”€ .gitattributes
β”œβ”€β”€ .env
β”œβ”€β”€ research/
β”‚ └── notebook.ipynb
└── src/
β”œβ”€β”€ __init__.py
β”œβ”€β”€ config/
β”‚ β”œβ”€β”€ __init__.py
β”‚ └── settings.py
└── utils/
β”œβ”€β”€ __init__.py
└── helpers.py
```
## Setup
- GitHub repo: [link](https://github.com/elsayedelmandoh/multimodal-chatbot)
- Hugging Face Space: [link](https://huggingface.co/spaces/elsayedelmandoh/multimodal-chatbot)
### Clone from GitHub
```bash
git clone https://github.com/elsayedelmandoh/multimodal-chatbot
cd multimodal-chatbot
```
### Conda environment
```bash
# create & activate
conda create -n multimodal-chatbot python=3.12 -y
conda activate multimodal-chatbot
# install pip then dependencies
conda install pip -y
pip install -r requirements.txt
```
Create a .env file at the project root or store vars in the Hugging Face Space secrets.
### Run locally
```bash
python app.py
```
Open the URL printed in the terminal.
### Duplicate Hugging Face Space
Hugging Face Space URL used in this project:
https://huggingface.co/spaces/elsayedelmandoh/multimodal-chatbot
1. Sign in to Hugging Face.
2. Go to the Space you want to duplicate.
3. Click the "Duplicate this Space" button.
4. Choose a new name and visibility, then Duplicate Space.
5. In the new Space settings add secrets (GEMINI_API_KEY) and push code.
## Contributing
1. Fork the repository.
2. Create a branch for your change.
3. Make changes, commit with clear messages.
4. Push to your fork and open a pull request.
## Author
Developed by Elsayed Elmandoh β€” NLP Engineer.
LinkedIn: https://linkedin.com/in/elsayed-elmandoh-b5849a1b8/
X/Twitter: https://x.com/aangpy