multimodal-chatbot / README.md
elsayedelmandoh's picture
upload project
a1660ff

A newer version of the Gradio SDK is available: 6.15.1

Upgrade
metadata
title: Multimodal Chatbot
emoji: πŸ†
colorFrom: green
colorTo: red
sdk: gradio
sdk_version: 6.5.1
app_file: app.py
pinned: false

Build and Deploy a Multimodal Chatbot

Table of Contents

Overview

This is a multimodal chatbot application that leverages advanced image analysis and computer vision capabilities. It accepts both text and image inputs, analyzing visual content in detail to understand composition, objects, and contextual elements. The chatbot processes uploaded images through state-of-the-art AI models, providing specific descriptions and insights that can be used across machine learning, computer vision, and multimodal AI applications. Perfect for tasks requiring visual understanding combined with conversational AI.

Project Structure

web-search/
β”œβ”€β”€ app.py
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ README.md
β”œβ”€β”€ .gitignore
β”œβ”€β”€ .gitattributes
β”œβ”€β”€ .env
β”œβ”€β”€ research/
β”‚   └── notebook.ipynb
└── src/
    β”œβ”€β”€ __init__.py
    β”œβ”€β”€ config/
    β”‚   β”œβ”€β”€ __init__.py
    β”‚   └── settings.py
    └── utils/
        β”œβ”€β”€ __init__.py
        └── helpers.py

Setup

  • GitHub repo: link
  • Hugging Face Space: link

Clone from GitHub

git clone https://github.com/elsayedelmandoh/multimodal-chatbot
cd multimodal-chatbot

Conda environment

# create & activate
conda create -n multimodal-chatbot python=3.12 -y
conda activate multimodal-chatbot

# install pip then dependencies
conda install pip -y
pip install -r requirements.txt

Create a .env file at the project root or store vars in the Hugging Face Space secrets.

Run locally

python app.py

Open the URL printed in the terminal.

Duplicate Hugging Face Space

Hugging Face Space URL used in this project: https://huggingface.co/spaces/elsayedelmandoh/multimodal-chatbot

  1. Sign in to Hugging Face.
  2. Go to the Space you want to duplicate.
  3. Click the "Duplicate this Space" button.
  4. Choose a new name and visibility, then Duplicate Space.
  5. In the new Space settings add secrets (GEMINI_API_KEY) and push code.

Contributing

  1. Fork the repository.
  2. Create a branch for your change.
  3. Make changes, commit with clear messages.
  4. Push to your fork and open a pull request.

Author

Developed by Elsayed Elmandoh β€” NLP Engineer.
LinkedIn: https://linkedin.com/in/elsayed-elmandoh-b5849a1b8/
X/Twitter: https://x.com/aangpy