KPLabs's picture
Update README.md
3e10a2c verified
---
base_model:
- ibm-esa-geospatial/TerraMind-1.0-base
pipeline_tag: image-classification
tags:
- methane
- detection
- geospatial
- terramind
---
# FAST-EO Use Case 2 - Methane Detection
This repository contains data and code to reproduce experiments for fine-tuning **TerraMind-Base** to detect methane-related signatures in multispectral imagery. It includes multiple experiment variants and their corresponding datasets.
The file `Methane_benchmark_patches_summary_v3.xlsx` provides per-patch descriptions and defines the **fold splits** used to ensure non-overlapping partitions. Runner scripts use this Excel file to build train/val/test splits, typically reserving one fold for testing.
All scripts provide usage instructions via `--help` (or `-h`).
## Requirements
- Install the TerraMind/Terratorch stack used by the project before running experiments.
- Ensure your environment has the required dependencies for data loading and training.
## Experiments
### Experiment 1: Fine-tuning on Methane Benchmark Dataset (MBD)
Fine-tune TerraMind-Base on the Methane Benchmark Dataset. The normalized dataset is provided in:
- `MBD_nan_S2_zscore/`
Training code is located in:
- `classification/` (includes dataset and dataloader classes)
### Experiment 2: Fine-tuning on MBD with text captions
This experiment modifies the TerraMind-Base model to concatenate text-caption embeddings with visual embeddings.
- Caption embeddings are computed with `all-MiniLM-L6-v2`.
- Code and resources are in `classification_with_text/`.
- Original captions: `classification_with_text/MBD_text/`
- Precomputed embeddings: `combined_caption_embeddings.csv`
### Experiment 3: Sentinel-2 with simulated atmospheric conditions
Evaluate generalization on Sentinel-2 data with simulated atmospheric conditions in both:
- Top-of-Atmosphere (TOA)
- Bottom-of-Atmosphere (BOA)
The model can be trained on this simulated data or used only for inference to test cross-domain robustness.
### Experiment 4: Intuition-1 with simulated atmospheric conditions
Analogous to Experiment 3, but using Intuition-1 imagery with simulated atmospheric conditions (TOA and BOA). This experiment tests robustness under domain shift.
### Experiment 5: Urban dataset without methane (false-positive stress test)
A control dataset containing only urban imagery without methane is used to check whether models learn methane-specific cues rather than urban signatures. The goal is to quantify false positives.
Scripts for loading and running inference on this dataset are provided in the repository.
## Notes
- Use `--help` on each runner/training script to see available options.
- Keep fold definitions consistent with `Methane_benchmark_patches_summary_v3.xlsx` to ensure comparable results across experiments.