Jidi1997 commited on
Commit
54140cb
Β·
verified Β·
1 Parent(s): 749e9f8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -6
README.md CHANGED
@@ -32,9 +32,11 @@ metrics:
32
 
33
  ## πŸ“‹ Model Summary
34
 
35
- This model is a fine-tuned version of climatebert/distilroberta-base-climate-detector. It is specifically engineered to classify shareholder proposals into green (climate/environmental) or non-green categories.
36
 
37
- Crucially, it is highly effective at isolating environmental topics from **broad, mixed-ESG contexts** without being distracted by generic sustainability or governance buzzwords (etc. Neutrality, Waste, Water...). Trained on a meticulously curated dataset, it achieves an F1 score of 0.981 on the validation set.
 
 
38
 
39
  >πŸ’‘ Designed for: Precision text classification in sustainable finance, ESG analysis, and corporate governance contexts.
40
 
@@ -78,7 +80,7 @@ dataset = datasets.Dataset.from_dict({"text": [
78
  """A(An) institutional-type sponsor has filed a shareholder proposal to a(an)
79
  energy-sector company. This proposal requests: the company to issue a report
80
  on its greenhouse gas emissions reduction targets.
81
- It falls under a broader agenda class: Environmental/Social."""
82
  ]})
83
 
84
  # ── Inference ──────────────────────────────────────────────────────────────────
@@ -92,7 +94,7 @@ for out in tqdm(pipe(KeyDataset(dataset, "text"), padding=True, truncation=True)
92
 
93
  ### πŸ“Œ Recommended Input Format
94
 
95
- To achieve optimal performance, structure your input text to mirror the training data format:
96
  ```
97
  "A(An) {sponsor_type}-type sponsor has filed a shareholder proposal to a(an)
98
  {sic2_des}-sector company. This proposal requests: {resolution}.
@@ -104,8 +106,8 @@ relevant to this specific proposal: {AgendaCodeInformation}"
104
  |:---|:---|:---|
105
  | `{sponsor_type}` | Type of proposal sponsor | `institutional`, `individual` |
106
  | `{sic2_des}` | SIC-2 industry sector description | `energy`, `manufacturing` |
107
- | `{resolution}` | Full text of the proposal resolution | *"the company to report on..."* |
108
- | `{AgendaCodeInformation}` | ISS agenda code label *(optional but recommended)* | `Environmental/Social` |
109
 
110
  > πŸ’‘ **Tip:** The `{AgendaCodeInformation}` field is optional but including it generally improves prediction confidence, as it provides additional categorical context into brief resolution context.
111
 
 
32
 
33
  ## πŸ“‹ Model Summary
34
 
35
+ Shareholder resolutions are often brief and cause ambiguity when considered in isolation. For example, a proposal requesting a report on ``water risk management'' may refer either to environmental water stress (an environmental issue) or to the human right to water access (a social issue).
36
 
37
+ This model is a fine-tuned version of ClimateBERT and specifically engineered to classify shareholder proposals into green (climate/environmental) or non-green categories.
38
+
39
+ Crucially, it is highly effective at isolating environmental topics from **broad, mixed-ESG contexts** without being distracted by generic sustainability or governance **buzzwords** (etc. Neutrality, Waste, Water...).
40
 
41
  >πŸ’‘ Designed for: Precision text classification in sustainable finance, ESG analysis, and corporate governance contexts.
42
 
 
80
  """A(An) institutional-type sponsor has filed a shareholder proposal to a(an)
81
  energy-sector company. This proposal requests: the company to issue a report
82
  on its greenhouse gas emissions reduction targets.
83
+ It falls under a broader agenda class: "..."""
84
  ]})
85
 
86
  # ── Inference ──────────────────────────────────────────────────────────────────
 
94
 
95
  ### πŸ“Œ Recommended Input Format
96
 
97
+ To address ambiguity in raw proposal text, we can enhance the model's input with structured proposal- and firm-level context, like the training data format:
98
  ```
99
  "A(An) {sponsor_type}-type sponsor has filed a shareholder proposal to a(an)
100
  {sic2_des}-sector company. This proposal requests: {resolution}.
 
106
  |:---|:---|:---|
107
  | `{sponsor_type}` | Type of proposal sponsor | `institutional`, `individual` |
108
  | `{sic2_des}` | SIC-2 industry sector description | `energy`, `manufacturing` |
109
+ | `{resolution}` | Full text of the proposal resolution | *"Report on Climate Change Performance Metrics Into Executive Compensation Program..."* |
110
+ | `{AgendaCodeInformation}` | Description of ISS agenda code | *"This code is used for proposals seeking..."* |
111
 
112
  > πŸ’‘ **Tip:** The `{AgendaCodeInformation}` field is optional but including it generally improves prediction confidence, as it provides additional categorical context into brief resolution context.
113