Spaces:

txarst
/

pupillometry

Sleeping

App Files Files Community

pupillometry / README.md

txarst

Update README.md

659d40d verified 5 months ago

preview code

raw

history blame contribute delete

5.3 kB

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

metadata

title: PupilSense
emoji: 👁️
colorFrom: red
colorTo: pink
sdk: gradio
sdk_version: 5.38.2
app_file: app.py
pinned: false

👁️ PupilSense 👁️🕵️‍♂️

PupilSense is a deep learning-powered application for estimating pupil diameter from images and videos. It uses trained ResNet models with Class Activation Mapping (CAM) for interpretable predictions.

Features

Image Processing: Upload images to get instant pupil diameter estimates
Video Processing: Analyze videos frame-by-frame for temporal pupil diameter analysis
Model Selection: Choose between ResNet18 and ResNet50 architectures
Pupil Selection: Analyze left pupil, right pupil, or both
Blink Detection: Automatically detect and handle blinks in the analysis
CAM Visualization: See which parts of the eye the model focuses on for predictions
API Access: Full Gradio API support for programmatic access

Usage

Web Interface

Simply upload an image or video file and configure your analysis parameters:

Select pupil(s) to analyze (left, right, or both)
Choose the model architecture (ResNet18 or ResNet50)
Enable/disable blink detection
Click process to get results

API Access

The Gradio interface provides automatic API endpoints. You can access the API documentation at /docs when the app is running.

Example API usage:

import requests
import json

# For image processing
files = {"image_input": open("your_image.jpg", "rb")}
data = {
    "pupil_selection": "both",
    "tv_model": "ResNet18",
    "blink_detection": True
}
response = requests.post("https://your-space-url/api/predict", files=files, data=data)

Model Information

The application uses pre-trained ResNet models specifically trained for pupil diameter estimation:

ResNet18: Faster inference, good accuracy
ResNet50: Higher accuracy, slower inference

Both models support:

Input resolution: 32x64 pixels (eye region)
Output: Pupil diameter in millimeters
CAM visualization for model interpretability

Technical Details

Face Detection: MediaPipe for robust face and eye detection
Preprocessing: Automatic eye region extraction and normalization
Deep Learning: PyTorch-based ResNet models
Visualization: Matplotlib for result plotting and CAM overlays
Video Support: Frame-by-frame analysis with temporal plotting

Installation & Setup

Local Development

Clone the repository

git clone <repository-url>
cd pupilsense

Create virtual environment

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

Run the application

python app.py

The app will be available at http://localhost:7860

Hugging Face Spaces Deployment

Create a new Space on Hugging Face with Gradio SDK
Upload all files from the pupilsense directory
Ensure the following files are present:
- app.py (main application file)
- gradio_app.py (Gradio interface)
- gradio_utils.py (utility functions)
- requirements.txt (dependencies)
- README.md (this file with proper YAML header)
- pre_trained_models/ (model files)
- All other supporting files

Known Issues & Troubleshooting

MediaPipe Issues

Issue: Segmentation fault or MediaPipe errors in headless environments
Solution: The app includes error handling for MediaPipe failures. In production environments, ensure proper GPU/display drivers are available.

Model Loading

Issue: Model files not found
Solution: Ensure pre_trained_models/ directory contains the required .pt files for both ResNet18 and ResNet50 models.

Memory Usage

Issue: High memory usage with large videos
Solution: The app automatically resizes frames to 640x480 to manage memory usage.

File Structure

pupilsense/
├── app.py                 # Main application entry point
├── gradio_app.py         # Gradio interface definition
├── gradio_utils.py       # Utility functions (MediaPipe-free)
├── app_utils.py          # Original Streamlit utilities (legacy)
├── requirements.txt      # Python dependencies
├── README.md            # This file
├── config.yml           # Configuration file
├── registry.py          # Model registry
├── registry_utils.py    # Registry utilities
├── utils.py             # General utilities
├── pre_trained_models/  # Trained model files
│   ├── ResNet18/
│   │   ├── left_eye.pt
│   │   └── right_eye.pt
│   └── ResNet50/
│       ├── left_eye.pt
│       └── right_eye.pt
├── preprocessing/       # Data preprocessing modules
├── feature_extraction/  # Feature extraction modules
├── registrations/       # Model registration modules
└── sample_videos/       # Sample video files

Contributing

Fork the repository
Create a feature branch
Make your changes
Test thoroughly
Submit a pull request

License

See LICENSE file for details.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference