| | --- |
| | base_model: |
| | - ibm-esa-geospatial/TerraMind-1.0-base |
| | pipeline_tag: image-classification |
| | tags: |
| | - methane |
| | - detection |
| | - geospatial |
| | - terramind |
| | --- |
| | |
| | # FAST-EO Use Case 2 - Methane Detection |
| |
|
| | This repository contains data and code to reproduce experiments for fine-tuning **TerraMind-Base** to detect methane-related signatures in multispectral imagery. It includes multiple experiment variants and their corresponding datasets. |
| |
|
| | The file `Methane_benchmark_patches_summary_v3.xlsx` provides per-patch descriptions and defines the **fold splits** used to ensure non-overlapping partitions. Runner scripts use this Excel file to build train/val/test splits, typically reserving one fold for testing. |
| |
|
| | All scripts provide usage instructions via `--help` (or `-h`). |
| |
|
| | ## Requirements |
| |
|
| | - Install the TerraMind/Terratorch stack used by the project before running experiments. |
| | - Ensure your environment has the required dependencies for data loading and training. |
| |
|
| | ## Experiments |
| |
|
| | ### Experiment 1: Fine-tuning on Methane Benchmark Dataset (MBD) |
| |
|
| | Fine-tune TerraMind-Base on the Methane Benchmark Dataset. The normalized dataset is provided in: |
| |
|
| | - `MBD_nan_S2_zscore/` |
| |
|
| | Training code is located in: |
| |
|
| | - `classification/` (includes dataset and dataloader classes) |
| |
|
| | ### Experiment 2: Fine-tuning on MBD with text captions |
| |
|
| | This experiment modifies the TerraMind-Base model to concatenate text-caption embeddings with visual embeddings. |
| |
|
| | - Caption embeddings are computed with `all-MiniLM-L6-v2`. |
| | - Code and resources are in `classification_with_text/`. |
| | - Original captions: `classification_with_text/MBD_text/` |
| | - Precomputed embeddings: `combined_caption_embeddings.csv` |
| |
|
| | ### Experiment 3: Sentinel-2 with simulated atmospheric conditions |
| |
|
| | Evaluate generalization on Sentinel-2 data with simulated atmospheric conditions in both: |
| | - Top-of-Atmosphere (TOA) |
| | - Bottom-of-Atmosphere (BOA) |
| |
|
| | The model can be trained on this simulated data or used only for inference to test cross-domain robustness. |
| |
|
| | ### Experiment 4: Intuition-1 with simulated atmospheric conditions |
| |
|
| | Analogous to Experiment 3, but using Intuition-1 imagery with simulated atmospheric conditions (TOA and BOA). This experiment tests robustness under domain shift. |
| |
|
| | ### Experiment 5: Urban dataset without methane (false-positive stress test) |
| |
|
| | A control dataset containing only urban imagery without methane is used to check whether models learn methane-specific cues rather than urban signatures. The goal is to quantify false positives. |
| |
|
| | Scripts for loading and running inference on this dataset are provided in the repository. |
| |
|
| | ## Notes |
| |
|
| | - Use `--help` on each runner/training script to see available options. |
| | - Keep fold definitions consistent with `Methane_benchmark_patches_summary_v3.xlsx` to ensure comparable results across experiments. |
| |
|