| --- |
| language: en |
| license: mit |
| tags: |
| - computer-vision |
| - image-segmentation |
| - plant-disease |
| - agricultural-ai |
| - foundation-model |
| - sam |
| - yolo |
| - coffee |
| - rust-disease |
| datasets: |
| - coffee-leaf-rust-severity |
| metrics: |
| - iou |
| - dice |
| - precision |
| - recall |
| - lin-concordance-correlation-coefficient |
| --- |
| |
| Foundation Model–Assisted Coffee Leaf Rust Severity Estimation |
|
|
| This repository accompanies the manuscript: |
|
|
| Foundation model–assisted segmentation enables robust field-based severity estimation of coffee leaf rust |
|
|
| This project presents a fully reproducible computer vision pipeline for quantitative estimation of coffee leaf rust (Hemileia vastatrix) severity under heterogeneous field conditions. The framework integrates object detection, lesion segmentation, pixel-based severity quantification, and concordance analysis grounded in phytopathometry principles. |
|
|
| The study compares classical image processing, supervised deep learning, and foundation segmentation models for lesion detection, and evaluates agreement with gold-standard pixel-level annotations using Lin’s Concordance Correlation Coefficient (LCCC). |
|
|
|
|
| Data Card Author(s) |
| Mary Paz Romero Benavides, Universidade Federal de Viçosa: Owner / Manager |
| Emerson M. Del Ponte, Universidade Federal de Viçosa: Contributor |
| Waldênia de Melo Moura, EPAMIG: Contributor |
|
|
|
|
|
|
|
|
| 🌱 Project Overview |
|
|
| The methodological workflow consists of: |
|
|
| Leaf Detection – YOLOv8 trained using model-assisted annotations |
|
|
| Leaf Extraction – Detection-guided segmentation |
|
|
| Lesion Segmentation – Comparison of five approaches: |
|
|
| ImageJ thresholding |
|
|
| pliman (R package) |
|
|
| DeepLabV3+ |
|
|
| Fine-tuned SAM2 (SAM_CLR) |
| |
| Zero-shot SAM3 |
| |
| Severity Estimation – Pixel-based calculation: |
| |
| S (%) = Diseased Area / Leaf Area × 100 |
| |
| Agreement Analysis – Lin’s Concordance Correlation Coefficient between predicted and reference severity |
| |
| 📊 Dataset Summary |
| |
| The full dataset comprises: |
| |
| 1,285 field-acquired coffee leaf images |
| |
| 606 curated pixel-level rust lesion masks |
| |
| 100 independent evaluation masks |
| |
| Roboflow dataset links: |
| |
| CLR_SAM_dataset: |
| https://universe.roboflow.com/clr-zky50/sam_clr/dataset/1 |
|
|
| DL506: |
| https://universe.roboflow.com/clr-zky50/dl506/dataset/1 |
|
|
| GoldenStandard: |
| https://universe.roboflow.com/clr-zky50/imgtest-fvn9j/dataset/1 |
|
|
| 📂 Repository Structure |
| 📁 01_models |
| |
| Contains documentation describing the trained models used in this study. |
| |
| ⚠️ Due to GitHub file size limitations, model weights are hosted on Hugging Face. |
| |
| Models include: |
| |
| YOLOv8 leaf detector |
| |
| Fine-tuned SAM2 (SAM_CLR) |
|
|
| DeepLabV3+ |
|
|
| Configuration used for zero-shot SAM3 inference |
|
|
| 📁 02_binary_images |
|
|
| Contains validation binary masks (PNG format) corresponding to segmentation outputs from each evaluated method. |
|
|
| These masks were used to compute: |
|
|
| Intersection over Union (IoU) |
|
|
| Dice coefficient |
|
|
| Pixel accuracy |
|
|
| Precision |
|
|
| Recall |
|
|
| Disease severity (%) |
|
|
| Lin’s Concordance Correlation Coefficient (LCCC) |
|
|
| Binary mask format: |
|
|
| 0 → background |
|
|
| 255 → rust lesion |
|
|
| This folder enables independent verification of segmentation performance and severity calculations. |
|
|
| 📁 03_analysis |
| |
| Contains R scripts used to: |
| |
| Compute severity metrics |
| |
| Perform agreement and concordance analysis |
| |
| Generate all figures included in the manuscript |
| |
| Main R dependencies: |
| |
| tidyverse |
| |
| epiR |
| |
| lme4 |
| |
| ggplot2 |
| |
| This folder reproduces the statistical analysis pipeline described in the paper. |
| |
| 🔬 Reproducibility |
| |
| This repository provides: |
| |
| Validation segmentation outputs |
| |
| Statistical analysis scripts |
| |
| Model documentation |
| |
| External links to trained weights |
| |
| Together, these components allow full reproducibility of segmentation metrics and severity agreement results reported in the manuscript. |
| |
| 🤖 Model Hosting |
| |
| All trained model weights are hosted on Hugging Face: |
| |
| 👉 https://huggingface.co/MaryPazRB/Paper_CLR_CV |
| |
| This ensures accessibility without exceeding GitHub file size limitations. |
| |
| 📜 License |
| |
| Code: MIT License |
| |
| Binary masks and annotations: CC-BY 4.0 |
| |
| |
| For questions or collaboration inquiries, please open an issue or contact the corresponding author. |