image-caption-ai / README.md
Ishu8904's picture
Final cleanup and Dockerfile fix
fcf3f32
metadata
title: Image Captioning AI
sdk: docker
app_port: 5000

Image Caption AI

This is an end-to-end AI application that generates descriptive captions for uploaded images.

How it Works

The application is built with a classic Encoder-Decoder architecture:

  • Encoder: A pre-trained ResNet-50 Convolutional Neural Network (CNN) extracts visual features from the image.
  • Decoder: A Long Short-Term Memory (LSTM) Recurrent Neural Network (RNN) with an Attention mechanism generates the caption word by word based on the image features.

Tech Stack

  • Model: PyTorch
  • Web Framework: Flask
  • Deployment: Docker, Hugging Face Spaces