File size: 2,807 Bytes

3daddbd
 
3e10a2c
3daddbd
 
3e10a2c
 
 
 
3daddbd
3e10a2c
97a17c2
 
3e10a2c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
97a17c2
3e10a2c
97a17c2
3e10a2c
 
 
 
97a17c2
3e10a2c
97a17c2
3e10a2c
 
 
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c
97a17c2
3e10a2c

---
base_model:
  - ibm-esa-geospatial/TerraMind-1.0-base
pipeline_tag: image-classification
tags:
  - methane
  - detection
  - geospatial
  - terramind
---

# FAST-EO Use Case 2 - Methane Detection

This repository contains data and code to reproduce experiments for fine-tuning **TerraMind-Base** to detect methane-related signatures in multispectral imagery. It includes multiple experiment variants and their corresponding datasets.

The file `Methane_benchmark_patches_summary_v3.xlsx` provides per-patch descriptions and defines the **fold splits** used to ensure non-overlapping partitions. Runner scripts use this Excel file to build train/val/test splits, typically reserving one fold for testing.

All scripts provide usage instructions via `--help` (or `-h`).

## Requirements

- Install the TerraMind/Terratorch stack used by the project before running experiments.
- Ensure your environment has the required dependencies for data loading and training.

## Experiments

### Experiment 1: Fine-tuning on Methane Benchmark Dataset (MBD)

Fine-tune TerraMind-Base on the Methane Benchmark Dataset. The normalized dataset is provided in:

- `MBD_nan_S2_zscore/`

Training code is located in:

- `classification/` (includes dataset and dataloader classes)

### Experiment 2: Fine-tuning on MBD with text captions

This experiment modifies the TerraMind-Base model to concatenate text-caption embeddings with visual embeddings.

- Caption embeddings are computed with `all-MiniLM-L6-v2`.
- Code and resources are in `classification_with_text/`.
- Original captions: `classification_with_text/MBD_text/`
- Precomputed embeddings: `combined_caption_embeddings.csv`

### Experiment 3: Sentinel-2 with simulated atmospheric conditions

Evaluate generalization on Sentinel-2 data with simulated atmospheric conditions in both:
- Top-of-Atmosphere (TOA)
- Bottom-of-Atmosphere (BOA)

The model can be trained on this simulated data or used only for inference to test cross-domain robustness.

### Experiment 4: Intuition-1 with simulated atmospheric conditions

Analogous to Experiment 3, but using Intuition-1 imagery with simulated atmospheric conditions (TOA and BOA). This experiment tests robustness under domain shift.

### Experiment 5: Urban dataset without methane (false-positive stress test)

A control dataset containing only urban imagery without methane is used to check whether models learn methane-specific cues rather than urban signatures. The goal is to quantify false positives.

Scripts for loading and running inference on this dataset are provided in the repository.

## Notes

- Use `--help` on each runner/training script to see available options.
- Keep fold definitions consistent with `Methane_benchmark_patches_summary_v3.xlsx` to ensure comparable results across experiments.