palubad commited on
Commit
812e4ec
·
verified ·
1 Parent(s): 4606005

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -69
README.md CHANGED
@@ -10,7 +10,6 @@ The best-performing models were Random Forest Regressor (RFR) for LAI and FAPAR
10
  These models are part of the paper Paluba, D., Le Saux, B., Sarti, F., Štych, P. (2025): Estimating vegetation indices and biophysical parameters for Central European temperate forests with Sentinel-1 SAR data and machine learning. Published in Big Earth Data
11
 
12
  ## Model Details
13
-
14
  ### Model Description
15
 
16
  The study explores the feasibility of using SAR-based features in combination with additional datasets (e.g., DEM-based features and meteorological data) to estimate optical VIs, specifically, LAI, FAPAR, EVI and NDVI. Traditional optical remote sensing methods are often hindered by cloud cover, making it difficult to obtain continuous and reliable vegetation monitoring data. This research addresses this challenge by applying SAR data, which is unaffected by atmospheric conditions.
@@ -18,8 +17,7 @@ Using ML, particularly RFR and XGB, the study demonstrates that SAR-based VIs ca
18
 
19
 
20
  - **Developed by:** Daniel Paluba, Bertrand Le Saux
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Model type:** [More Information Needed]
23
  - **License:** CC BY 4.0
24
 
25
  ### Model Sources
@@ -53,72 +51,59 @@ Paluba et al. 2025: Estimating vegetation indices and biophysical parameters for
53
  ## How to Get Started with the Model
54
 
55
  To implement this model:
56
- - Prepare input datasets using the MMTS-GEE tool: Collect Sentinel-1 SAR data, DEM-based features, and meteorological variables.
57
- - Preprocess data: Use the MMTS-GEE tool for temporal and spatial alignment.
58
- - Train the model: Implement RFR for LAI/FAPAR and XGB for EVI/NDVI using optimized hyperparameters.
59
- - Evaluate performance: Compare model outputs with Sentinel-2-based VIs to validate accuracy.
 
 
 
60
  - Deploy for inference: Apply trained models to monitor vegetation indices in new regions or for near real-time applications.
61
 
62
- [More Information Needed]
63
 
64
  ## Training Details
65
 
66
  ### Training Data
67
 
68
- The model was trained on:
69
- - Sentinel-1 SAR time series (VH and VV polarizations).
70
- - Sentinel-2 optical vegetation indices (LAI, FAPAR, EVI, NDVI) as ground truth.
71
- - Digital Elevation Model (DEM)-based features (elevation, slope, LIA).
72
- - Meteorological variables (temperature, precipitation).
73
- - Forest type maps (broad-leaved vs. coniferous).
74
- - Geographic scope: Czechia for training, validated on Central European forests.
75
 
76
  ### Training Procedure
77
 
78
- Feature Selection: Using permutation feature importance analysis to identify key predictors.
79
- Data Splitting: Training and validation sets created with a balanced representation of healthy and disturbed forests.
80
- Hyperparameter Optimization:
81
- RFR: Fine-tuned for maximum depth, number of trees, and minimum samples per split.
82
- XGB: Optimized learning rate, tree depth, and number of boosting rounds.
83
- Model Training: Using scikit-learn and XGBoost libraries with MAE loss function.
84
- Computational Requirements:
85
- XGB: Faster training with built-in early stopping (~30-70x faster than RFR).
86
- RFR: Slower but slightly better performance for LAI/FAPAR.
87
-
88
-
89
- #### Training Hyperparameters
90
 
91
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
92
 
93
- #### Speeds, Sizes, Times [optional]
94
 
95
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
96
 
97
- [More Information Needed]
98
 
99
- ## Evaluation
100
 
101
- Mean Absolute Error (MAE): Primary metric for accuracy.
102
- Score: To assess correlation with Sentinel-2 VIs.
103
- Transferability Test: Applied to different Central European forests.
 
104
 
105
  ### Results
106
 
107
  Best models:
108
- RFR performed best for LAI (MAE ~0.06) and FAPAR.
109
- XGB performed best for EVI and NDVI.
110
- SAR-based VIs successfully replicated optical VIs, with clear seasonal and forest-type differentiation.
111
- Higher MAEs observed in NDVI estimation (~0.48), attributed to forest type inaccuracies and change detection errors.
112
- SAR-based VIs detected forest changes up to 4 days earlier than Sentinel-2 VIs, significantly improving change detection capabilities.
113
- Adding DEM and meteorological features improved R² by 3-4%.
114
-
115
-
116
- #### Summary
117
-
118
-
119
- ### Used computation infrastructure
120
-
121
- 12th Gen Intel(R) Core(TM) i7-12700 with 2.10 GHz, 64 Gigabyte of RAM and 20 CPU cores.
122
 
123
 
124
  ## Citation [optional]
@@ -127,26 +112,8 @@ Adding DEM and meteorological features improved R² by 3-4%.
127
 
128
  **BibTeX:**
129
 
130
- [More Information Needed]
131
 
132
  **APA:**
133
 
134
- [More Information Needed]
135
-
136
- ## Glossary [optional]
137
-
138
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
139
-
140
- [More Information Needed]
141
-
142
- ## More Information [optional]
143
-
144
- [More Information Needed]
145
-
146
- ## Model Card Authors [optional]
147
-
148
- [More Information Needed]
149
-
150
- ## Model Card Contact
151
-
152
- [More Information Needed]
 
10
  These models are part of the paper Paluba, D., Le Saux, B., Sarti, F., Štych, P. (2025): Estimating vegetation indices and biophysical parameters for Central European temperate forests with Sentinel-1 SAR data and machine learning. Published in Big Earth Data
11
 
12
  ## Model Details
 
13
  ### Model Description
14
 
15
  The study explores the feasibility of using SAR-based features in combination with additional datasets (e.g., DEM-based features and meteorological data) to estimate optical VIs, specifically, LAI, FAPAR, EVI and NDVI. Traditional optical remote sensing methods are often hindered by cloud cover, making it difficult to obtain continuous and reliable vegetation monitoring data. This research addresses this challenge by applying SAR data, which is unaffected by atmospheric conditions.
 
17
 
18
 
19
  - **Developed by:** Daniel Paluba, Bertrand Le Saux
20
+ - **Funded by:** Charles University Grant Agency – Grantová Agentura Univerzity Karlovy (GAUK) Grant No. 412722; the European Union’s Caroline Herschel Framework Partnership Agreement on Copernicus User Uptake under grant agreement No. FPA 275/G/GRO/COPE/17/10042, project FPCUP (Framework Partnership Agreement on Copernicus User Uptake) and the Spatial Data Analyst project (NPO_UK_MSMT-16602/2022) funded by the European Union – NextGenerationEU
 
21
  - **License:** CC BY 4.0
22
 
23
  ### Model Sources
 
51
  ## How to Get Started with the Model
52
 
53
  To implement this model:
54
+ - Prepare input datasets using the [MMTS-GEE tool](https://github.com/palubad/MMTS-GEE): Collect Sentinel-1 SAR data, DEM-based features, and meteorological variables.
55
+ - The models were trained using the following input features:
56
+ - SAR features: VV, VH, incidence angle (angle), VV/VH, VH/VV
57
+ - DEM-based features: Local Incidence Angle (LIA), elevation and slope
58
+ - Meteorological features: sum of precipitation 12 hours prior to SAR acquisition (prec.12h) and temperature at the time of SAR acquisition;
59
+ - Land cover category: the forest type as a differentiating feature between coniferous and broad-leaved forests
60
+ - Temporal features: DOYsin and DOYcos containing information about the time of the corresponding SAR acquisition.
61
  - Deploy for inference: Apply trained models to monitor vegetation indices in new regions or for near real-time applications.
62
 
63
+ **Demo codes will be provided soon**
64
 
65
  ## Training Details
66
 
67
  ### Training Data
68
 
69
+ The training data is available from the [SAR-based-VIs GitHub repository](https://github.com/palubad/SAR-based-VIs).
 
 
 
 
 
 
70
 
71
  ### Training Procedure
72
 
73
+ - Feature Selection: Using permutation feature importance analysis to identify key predictors.
74
+ - Data Splitting: Training and validation sets created with a balanced representation of healthy and disturbed forests.
75
+ - Hyperparameter Optimization:
76
+ - RFR: Fine-tuned for maximum depth, number of trees, and minimum samples per split.
77
+ - XGB: Optimized learning rate, tree depth, and number of boosting rounds.
78
+ - Model Training: Using scikit-learn and XGBoost libraries with MAE loss function.
79
+ - Computational Requirements:
80
+ - XGB: Faster training with built-in early stopping (~30-70x faster than RFR).
81
+ - RFR: Slower but slightly better performance for LAI and FAPAR.
 
 
 
82
 
83
+ #### Used computation infrastructure
84
 
85
+ 12th Gen Intel(R) Core(TM) i7-12700 with 2.10 GHz, 64 Gigabyte of RAM and 20 CPU cores.
86
 
87
+ #### Training Hyperparameters
88
 
89
+ For detailed information on hyperparameter optimization, performances, speeds, please see the article Paluba et al. (2025).
90
 
91
+ ## Evaluation metrics
92
 
93
+ - Mean Absolute Error (MAE): Primary metric for accuracy.
94
+ - Mean Squared Error (MSE): Secondary metric for accuracy.
95
+ - R² Score: To assess correlation with Sentinel-2 VIs.
96
+ - Transferability Test: Applied to different Central European forests.
97
 
98
  ### Results
99
 
100
  Best models:
101
+ - RFR performed best for LAI (MAE ~0.06) and FAPAR.
102
+ - XGB performed best for EVI and NDVI.
103
+ - SAR-based VIs successfully replicated optical VIs, with clear seasonal and forest-type differentiation.
104
+ - Higher MAEs observed in NDVI estimation (~0.48), attributed to forest type inaccuracies and change detection errors.
105
+ - SAR-based VIs detected forest changes up to 4 days earlier than Sentinel-2 VIs, significantly improving change detection capabilities.
106
+ - Adding DEM and meteorological features improved R² by 3-4%.
 
 
 
 
 
 
 
 
107
 
108
 
109
  ## Citation [optional]
 
112
 
113
  **BibTeX:**
114
 
115
+ Will be added soon.
116
 
117
  **APA:**
118
 
119
+ Will be added soon.