wallacelw commited on
Commit
68c287a
·
verified ·
1 Parent(s): 7183b05

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md CHANGED
@@ -1,5 +1,19 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
  This model is a derivation from ModernBERT specialized in the Brazilian Portuguese language, **pre-trained** from data in this language scope.
@@ -66,3 +80,19 @@ This work was supported in part by Advanced Micro Devices, Inc. under the AMD AI
66
  HPC Cluster Program. Furthermore, the respective authors are appreciated for providing
67
  the Wikipedia, BrWac, ASSIN2, and LeNER-BR datasets
68
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - UFRGS/brwac
5
+ - wikimedia/wikipedia
6
+ - peluz/lener_br
7
+ - nilc-nlp/assin2
8
+ language:
9
+ - pt
10
+ metrics:
11
+ - f1
12
+ - accuracy
13
+ - precision
14
+ base_model:
15
+ - answerdotai/ModernBERT-base
16
+ pipeline_tag: fill-mask
17
  ---
18
 
19
  This model is a derivation from ModernBERT specialized in the Brazilian Portuguese language, **pre-trained** from data in this language scope.
 
80
  HPC Cluster Program. Furthermore, the respective authors are appreciated for providing
81
  the Wikipedia, BrWac, ASSIN2, and LeNER-BR datasets
82
 
83
+ ## Citation
84
+ If you use our work, please cite:
85
+
86
+ ```
87
+ @inproceedings{wu2025modbertbr,
88
+ author = {Wu, Wallace Ben Teng Lin and Garcia, Luis Paulo Faina},
89
+ title = {ModBERTBr: A ModernBERT-based Model for Brazilian Portuguese},
90
+ booktitle = {Anais do 22º Encontro Nacional de Inteligência Artificial e Computacional (ENIAC)},
91
+ year = {2025},
92
+ address = {Fortaleza, CE, Brasil},
93
+ pages = {2044--2055},
94
+ publisher = {Sociedade Brasileira de Computação},
95
+ issn = {2763-9061},
96
+ doi = {10.5753/eniac.2025.14516},
97
+ }
98
+ ```