Master-thesis-NAP
/

ModernBert-DAPT-math

Model card Files Files and versions

RosaMelo commited on May 19, 2025

Commit

a30384f

·

verified ·

1 Parent(s): 58f8657

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -7,7 +7,8 @@ tags: []
-weaufia' faopwf oain lk<!-- Provide a quick summary of what the model is/does. -->
@@ -15,9 +16,13 @@ weaufia' faopwf oain lk<!-- Provide a quick summary of what the model is/does. -
 ### Model Description
-qrqrq3rq3  q<qr qrq!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]

+This is a ModernBERT that it has been trained with Latex Files, of mathematical papers.  To improve the ability of reading latex files especialy the mathematical equations and parts.
+I
 ### Model Description
+It's been trained with 12099 mathematical papers. Where we did some preprocessing to eliminate the non-content-meaningfull parts of the papers.
+And we removed the latex parts that bring no information: as \\(?:begin|end)\{[^}]+\}" ,\item ,\\(?:noindent|medskip|smallskip|bigskip|newpage|clearpage)
+And more.
+The dataset is obtained by scrapping mathematical papers.
+One day I will finish this README, if you have a question, feel free to send me a mail: garciacomapol@gmail.com
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]