Update README.md
Browse files
README.md
CHANGED
|
@@ -32,9 +32,11 @@ metrics:
|
|
| 32 |
|
| 33 |
## π Model Summary
|
| 34 |
|
| 35 |
-
|
| 36 |
|
| 37 |
-
|
|
|
|
|
|
|
| 38 |
|
| 39 |
>π‘ Designed for: Precision text classification in sustainable finance, ESG analysis, and corporate governance contexts.
|
| 40 |
|
|
@@ -78,7 +80,7 @@ dataset = datasets.Dataset.from_dict({"text": [
|
|
| 78 |
"""A(An) institutional-type sponsor has filed a shareholder proposal to a(an)
|
| 79 |
energy-sector company. This proposal requests: the company to issue a report
|
| 80 |
on its greenhouse gas emissions reduction targets.
|
| 81 |
-
It falls under a broader agenda class:
|
| 82 |
]})
|
| 83 |
|
| 84 |
# ββ Inference ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
|
@@ -92,7 +94,7 @@ for out in tqdm(pipe(KeyDataset(dataset, "text"), padding=True, truncation=True)
|
|
| 92 |
|
| 93 |
### π Recommended Input Format
|
| 94 |
|
| 95 |
-
To
|
| 96 |
```
|
| 97 |
"A(An) {sponsor_type}-type sponsor has filed a shareholder proposal to a(an)
|
| 98 |
{sic2_des}-sector company. This proposal requests: {resolution}.
|
|
@@ -104,8 +106,8 @@ relevant to this specific proposal: {AgendaCodeInformation}"
|
|
| 104 |
|:---|:---|:---|
|
| 105 |
| `{sponsor_type}` | Type of proposal sponsor | `institutional`, `individual` |
|
| 106 |
| `{sic2_des}` | SIC-2 industry sector description | `energy`, `manufacturing` |
|
| 107 |
-
| `{resolution}` | Full text of the proposal resolution | *"
|
| 108 |
-
| `{AgendaCodeInformation}` | ISS agenda code
|
| 109 |
|
| 110 |
> π‘ **Tip:** The `{AgendaCodeInformation}` field is optional but including it generally improves prediction confidence, as it provides additional categorical context into brief resolution context.
|
| 111 |
|
|
|
|
| 32 |
|
| 33 |
## π Model Summary
|
| 34 |
|
| 35 |
+
Shareholder resolutions are often brief and cause ambiguity when considered in isolation. For example, a proposal requesting a report on ``water risk management'' may refer either to environmental water stress (an environmental issue) or to the human right to water access (a social issue).
|
| 36 |
|
| 37 |
+
This model is a fine-tuned version of ClimateBERT and specifically engineered to classify shareholder proposals into green (climate/environmental) or non-green categories.
|
| 38 |
+
|
| 39 |
+
Crucially, it is highly effective at isolating environmental topics from **broad, mixed-ESG contexts** without being distracted by generic sustainability or governance **buzzwords** (etc. Neutrality, Waste, Water...).
|
| 40 |
|
| 41 |
>π‘ Designed for: Precision text classification in sustainable finance, ESG analysis, and corporate governance contexts.
|
| 42 |
|
|
|
|
| 80 |
"""A(An) institutional-type sponsor has filed a shareholder proposal to a(an)
|
| 81 |
energy-sector company. This proposal requests: the company to issue a report
|
| 82 |
on its greenhouse gas emissions reduction targets.
|
| 83 |
+
It falls under a broader agenda class: "..."""
|
| 84 |
]})
|
| 85 |
|
| 86 |
# ββ Inference ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
|
|
|
| 94 |
|
| 95 |
### π Recommended Input Format
|
| 96 |
|
| 97 |
+
To address ambiguity in raw proposal text, we can enhance the model's input with structured proposal- and firm-level context, like the training data format:
|
| 98 |
```
|
| 99 |
"A(An) {sponsor_type}-type sponsor has filed a shareholder proposal to a(an)
|
| 100 |
{sic2_des}-sector company. This proposal requests: {resolution}.
|
|
|
|
| 106 |
|:---|:---|:---|
|
| 107 |
| `{sponsor_type}` | Type of proposal sponsor | `institutional`, `individual` |
|
| 108 |
| `{sic2_des}` | SIC-2 industry sector description | `energy`, `manufacturing` |
|
| 109 |
+
| `{resolution}` | Full text of the proposal resolution | *"Report on Climate Change Performance Metrics Into Executive Compensation Program..."* |
|
| 110 |
+
| `{AgendaCodeInformation}` | Description of ISS agenda code | *"This code is used for proposals seeking..."* |
|
| 111 |
|
| 112 |
> π‘ **Tip:** The `{AgendaCodeInformation}` field is optional but including it generally improves prediction confidence, as it provides additional categorical context into brief resolution context.
|
| 113 |
|