felipesuarezgiri commited on
Commit
b0a7b79
·
verified ·
1 Parent(s): 9a3b6c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -6
README.md CHANGED
@@ -2,16 +2,19 @@
2
  license: apache-2.0
3
  ---
4
  Biodiversity Commitment Classifier
5
- Model Overview
6
  This binary text classification model identifies and distinguishes biodiversity commitments in corporate sustainability reports. It classifies paragraphs as either:
7
 
8
  Commitment: The company pledges specific actions to improve biodiversity or reduce negative environmental impacts
9
  Non-commitment: General statements, observations, or non-actionable content related to biodiversity
10
 
11
- Model Architecture
12
  Built on ClimateBERT, a DistilRoBERTa-based model pre-trained on climate-related text, this classifier was fine-tuned specifically for biodiversity commitment detection in corporate disclosures.
13
- Training Data
14
  The model was trained on a curated dataset of 2,000 manually annotated paragraphs extracted from sustainability reports of Fortune Global 500 companies, ensuring high-quality labels and real-world applicability to corporate ESG analysis.
15
- Performance Metrics
16
- Evaluated using 5-fold cross-validation:
17
- MetricScoreWeighted F10.928 Weighted Precision0.930Weighted Recall0.929AUC-ROC0.976
 
 
 
 
2
  license: apache-2.0
3
  ---
4
  Biodiversity Commitment Classifier
5
+ Model Overview:
6
  This binary text classification model identifies and distinguishes biodiversity commitments in corporate sustainability reports. It classifies paragraphs as either:
7
 
8
  Commitment: The company pledges specific actions to improve biodiversity or reduce negative environmental impacts
9
  Non-commitment: General statements, observations, or non-actionable content related to biodiversity
10
 
11
+ Model Architecture:
12
  Built on ClimateBERT, a DistilRoBERTa-based model pre-trained on climate-related text, this classifier was fine-tuned specifically for biodiversity commitment detection in corporate disclosures.
13
+ Training Data:
14
  The model was trained on a curated dataset of 2,000 manually annotated paragraphs extracted from sustainability reports of Fortune Global 500 companies, ensuring high-quality labels and real-world applicability to corporate ESG analysis.
15
+
16
+ Recommended Pipeline: First use ESGBERT/EnvironmentalBERT-biodiversity to identify biodiversity-related paragraphs, then apply this model to identify commitments.
17
+
18
+ Performance Metrics:
19
+ Average of 5-fold cross-validation:
20
+ Weighted F10.928 Weighted Precision0.930Weighted Recall0.929AUC-ROC0.976