Spaces:
Sleeping
A newer version of the Gradio SDK is available: 6.15.1
title: Multimodal Chatbot
emoji: π
colorFrom: green
colorTo: red
sdk: gradio
sdk_version: 6.5.1
app_file: app.py
pinned: false
Build and Deploy a Multimodal Chatbot
Table of Contents
Overview
This is a multimodal chatbot application that leverages advanced image analysis and computer vision capabilities. It accepts both text and image inputs, analyzing visual content in detail to understand composition, objects, and contextual elements. The chatbot processes uploaded images through state-of-the-art AI models, providing specific descriptions and insights that can be used across machine learning, computer vision, and multimodal AI applications. Perfect for tasks requiring visual understanding combined with conversational AI.
Project Structure
web-search/
βββ app.py
βββ requirements.txt
βββ README.md
βββ .gitignore
βββ .gitattributes
βββ .env
βββ research/
β βββ notebook.ipynb
βββ src/
βββ __init__.py
βββ config/
β βββ __init__.py
β βββ settings.py
βββ utils/
βββ __init__.py
βββ helpers.py
Setup
Clone from GitHub
git clone https://github.com/elsayedelmandoh/multimodal-chatbot
cd multimodal-chatbot
Conda environment
# create & activate
conda create -n multimodal-chatbot python=3.12 -y
conda activate multimodal-chatbot
# install pip then dependencies
conda install pip -y
pip install -r requirements.txt
Create a .env file at the project root or store vars in the Hugging Face Space secrets.
Run locally
python app.py
Open the URL printed in the terminal.
Duplicate Hugging Face Space
Hugging Face Space URL used in this project: https://huggingface.co/spaces/elsayedelmandoh/multimodal-chatbot
- Sign in to Hugging Face.
- Go to the Space you want to duplicate.
- Click the "Duplicate this Space" button.
- Choose a new name and visibility, then Duplicate Space.
- In the new Space settings add secrets (GEMINI_API_KEY) and push code.
Contributing
- Fork the repository.
- Create a branch for your change.
- Make changes, commit with clear messages.
- Push to your fork and open a pull request.
Author
Developed by Elsayed Elmandoh β NLP Engineer.
LinkedIn: https://linkedin.com/in/elsayed-elmandoh-b5849a1b8/
X/Twitter: https://x.com/aangpy