Update README.md
Browse files
README.md
CHANGED
|
@@ -7,10 +7,10 @@ tags:
|
|
| 7 |
- climate
|
| 8 |
---
|
| 9 |
|
| 10 |
-
# CatastroBERT a model for Extreme weather events detection in
|
| 11 |
|
| 12 |
This model aims to facilitate the detection of paragraphs or articles relevant to extreme weather events
|
| 13 |
-
in
|
| 14 |
|
| 15 |
<div align=center>
|
| 16 |
<img src="images/bert_illustration.png" width="500" height="500" />
|
|
@@ -22,7 +22,7 @@ in french text. It is based on the [camembert-base](https://huggingface.co/camem
|
|
| 22 |
- **Language(s) (NLP):** French
|
| 23 |
- **Finetuned from model :** [camembert-base](https://huggingface.co/camembert-base) (RoBERTa Checkpoint)
|
| 24 |
|
| 25 |
-
- **Repository:** Check the [CatastroBERT](https://github.com/dh-epfl-students/dhlab-CatastroBERT)
|
| 26 |
|
| 27 |
## Usage
|
| 28 |
|
|
@@ -71,7 +71,7 @@ print(f"Prediction: {predict(text)}")
|
|
| 71 |
|
| 72 |
### Training Data
|
| 73 |
|
| 74 |
-
This model was trained on manually a manually
|
| 75 |
|
| 76 |
## Environmental Impact
|
| 77 |
|
|
|
|
| 7 |
- climate
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# CatastroBERT a model for Extreme weather events detection in French text
|
| 11 |
|
| 12 |
This model aims to facilitate the detection of paragraphs or articles relevant to extreme weather events
|
| 13 |
+
in French text. It is based on the [camembert-base](https://huggingface.co/camembert-base) model and was trained on manually annotated data (articles summaries) from the Gazette de Lausanne archives collected by [impresso](https://impresso-project.ch/)
|
| 14 |
|
| 15 |
<div align=center>
|
| 16 |
<img src="images/bert_illustration.png" width="500" height="500" />
|
|
|
|
| 22 |
- **Language(s) (NLP):** French
|
| 23 |
- **Finetuned from model :** [camembert-base](https://huggingface.co/camembert-base) (RoBERTa Checkpoint)
|
| 24 |
|
| 25 |
+
- **Repository:** Check the [CatastroBERT](https://github.com/dh-epfl-students/dhlab-CatastroBERT) GitHub page for more usage examples and information.
|
| 26 |
|
| 27 |
## Usage
|
| 28 |
|
|
|
|
| 71 |
|
| 72 |
### Training Data
|
| 73 |
|
| 74 |
+
This model was trained on manually a manually annotated dataset (articles summaries) curated from the Gazette de Lausanne archives collected by the [impresso](https://impresso-project.ch/) project. The dataset is composed of 4500 articles summaries of which 3500 were used for training and 1000 for validation.
|
| 75 |
|
| 76 |
## Environmental Impact
|
| 77 |
|