Segmentation / README.md
Edwin Salguero
Initial commit: SAM 2 Few-Shot/Zero-Shot Segmentation Research Framework
12fa055
|
raw
history blame
2.33 kB

SAM 2 Few-Shot/Zero-Shot Segmentation Research

This repository contains research on combining Segment Anything Model 2 (SAM 2) with minimal supervision for domain-specific segmentation tasks.

Research Overview

The goal is to study how SAM 2 can be adapted to new object categories in specific domains (satellite imagery, fashion, robotics) using:

  • Few-shot learning: 1-10 labeled examples per class
  • Zero-shot learning: No labeled examples, using text prompts and visual similarity

Key Research Areas

1. Domain Adaptation

  • Satellite Imagery: Buildings, roads, vegetation, water bodies
  • Fashion: Clothing items, accessories, patterns
  • Robotics: Industrial objects, tools, safety equipment

2. Learning Paradigms

  • Prompt Engineering: Optimizing text prompts for SAM 2
  • Visual Similarity: Using CLIP embeddings for zero-shot transfer
  • Meta-learning: Learning to adapt quickly to new domains

3. Evaluation Metrics

  • IoU (Intersection over Union)
  • Dice Coefficient
  • Boundary Accuracy
  • Domain-specific metrics

Project Structure

β”œβ”€β”€ data/                   # Dataset storage
β”œβ”€β”€ models/                 # Model implementations
β”œβ”€β”€ experiments/           # Experiment configurations
β”œβ”€β”€ utils/                 # Utility functions
β”œβ”€β”€ notebooks/             # Jupyter notebooks for analysis
β”œβ”€β”€ results/               # Experiment results and visualizations
└── requirements.txt       # Dependencies

Quick Start

  1. Install dependencies:

    pip install -r requirements.txt
    
  2. Download SAM 2:

    python scripts/download_sam2.py
    
  3. Run few-shot experiment:

    python experiments/few_shot_satellite.py
    
  4. Run zero-shot experiment:

    python experiments/zero_shot_fashion.py
    

Research Papers

This work builds upon:

Contributing

Please read our contributing guidelines and code of conduct before submitting pull requests.

License

MIT License - see LICENSE file for details.