Update README.md
Browse files
README.md
CHANGED
|
@@ -3,9 +3,34 @@ language: en
|
|
| 3 |
license: apache-2.0
|
| 4 |
---
|
| 5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
Using the [DistilRoBERTa](https://huggingface.co/distilroberta-base) model as starting point, the ClimateBERT Language Model is additionally pretrained on a text corpus comprising climate-related research paper abstracts, corporate and general news and reports from companies. The underlying methodology can be found in our [language model research paper](https://arxiv.org/abs/2110.12010).
|
| 7 |
|
| 8 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
```bibtex
|
| 10 |
@article{wkbl2021,
|
| 11 |
title={ClimateBERT: A Pretrained Language Model for Climate-Related Text},
|
|
|
|
| 3 |
license: apache-2.0
|
| 4 |
---
|
| 5 |
|
| 6 |
+
# Model Card for distilroberta-base-climate-s
|
| 7 |
+
|
| 8 |
+
## Model Description
|
| 9 |
+
|
| 10 |
+
This is the ClimateBERT language model based on the SIM-SELECT sample selection strategy.
|
| 11 |
+
|
| 12 |
+
*Note: We generally recommend choosing the [distilroberta-base-climate-f](https://huggingface.co/climatebert/distilroberta-base-climate-f) language model over this language model (unless you have good reasons not to).*
|
| 13 |
+
|
| 14 |
Using the [DistilRoBERTa](https://huggingface.co/distilroberta-base) model as starting point, the ClimateBERT Language Model is additionally pretrained on a text corpus comprising climate-related research paper abstracts, corporate and general news and reports from companies. The underlying methodology can be found in our [language model research paper](https://arxiv.org/abs/2110.12010).
|
| 15 |
|
| 16 |
+
## Climate performance card
|
| 17 |
+
|
| 18 |
+
| distilroberta-base-climate-s | |
|
| 19 |
+
|--------------------------------------------------------------------------|----------------|
|
| 20 |
+
| 1. Is the resulting model publicly available? | Yes |
|
| 21 |
+
| 2. How much time does the training of the final model take? | 8 hours |
|
| 22 |
+
| 3. How much time did all experiments take (incl. hyperparameter search)? | 288 hours |
|
| 23 |
+
| 4. What was the energy consumption (GPU/CPU)? | 0.7 kW |
|
| 24 |
+
| 5. At which geo location were the computations performed? | Germany |
|
| 25 |
+
| 6. What was the energy mix at the geo location? | 470 gCO2eq/kWh |
|
| 26 |
+
| 7. How much CO2eq was emitted to train the final model? | 2.63 kg |
|
| 27 |
+
| 8. How much CO2eq was emitted for all experiments? | 94.75 kg |
|
| 28 |
+
| 9. What is the average CO2eq emission for the inference of one sample? | 0.62 mg |
|
| 29 |
+
| 10. Which positive environmental impact can be expected from this work? | This work can be categorized as a building block tools following Jin et al (2021). It supports the training of NLP models in the field of climate change and, thereby, have a positive environmental impact in the future. |
|
| 30 |
+
| 11. Comments | Block pruning could decrease CO2eq emissions |
|
| 31 |
+
|
| 32 |
+
### Citation Information
|
| 33 |
+
|
| 34 |
```bibtex
|
| 35 |
@article{wkbl2021,
|
| 36 |
title={ClimateBERT: A Pretrained Language Model for Climate-Related Text},
|