imagebind / README.md
fcastrovilli's picture
feat: imagebind
64e55d6
|
raw
history blame
1.31 kB
metadata
title: Imagebind
emoji: 📚
colorFrom: red
colorTo: green
sdk: docker
pinned: false
license: mit
short_description: Small imagebind api implementation

ImageBind API Implementation

A FastAPI implementation of Facebook's ImageBind model for cross-modal embeddings.

Local Setup

  1. Install system dependencies:
sudo apt-get update && sudo apt-get install -y ffmpeg libsndfile1
  1. Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
  1. Install Python dependencies:
pip install --no-cache-dir --upgrade -r requirements.txt
  1. Download and setup ImageBind:
python setup_imagebind.py
pip install --no-cache-dir .

Docker Setup

Build and run the container:

docker build -t imagebind-api .
docker run -p 8000:8000 imagebind-api

API Endpoints

The API will be available at http://localhost:8000 with the following endpoints:

  • POST /compute_embeddings: Generate embeddings for images, audio files, and text
  • POST /compute_similarities: Compute similarities between embeddings

For detailed API documentation, visit http://localhost:8000/docs

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference