Update app.py
Browse files
app.py
CHANGED
|
@@ -78,10 +78,15 @@ article = """
|
|
| 78 |
</ul>
|
| 79 |
|
| 80 |
<h3>Conclusion and Future Work</h3>
|
| 81 |
-
|
|
|
|
|
|
|
|
|
|
| 82 |
However, the F1 Scores reported for the Biomedical roberta-based models are not far below from those of the general roberta-based model.
|
| 83 |
|
| 84 |
If only unanswerable questions are taken into account, the model with the best F1 Score is <a href="https://huggingface.co/hackathon-pln-es/roberta-base-biomedical-es-squad2-es">hackathon-pln-es/roberta-base-biomedical-es-squad2-es</a>.
|
|
|
|
|
|
|
| 85 |
The model <a href="https://huggingface.co/hackathon-pln-es/biomedtra-small-es-squad2-es">hackathon-pln-es/biomedtra-small-es-squad2-es</a>, on the contrary, shows inability to correctly identify unanswerable questions.
|
| 86 |
|
| 87 |
As future work, the following experiments could be carried out:
|
|
|
|
| 78 |
</ul>
|
| 79 |
|
| 80 |
<h3>Conclusion and Future Work</h3>
|
| 81 |
+
|
| 82 |
+
If F1 Score is considered, the results show that there may be no advantage in using domain-specific masked language models to generate Biomedical QA models.
|
| 83 |
+
|
| 84 |
+
|
| 85 |
However, the F1 Scores reported for the Biomedical roberta-based models are not far below from those of the general roberta-based model.
|
| 86 |
|
| 87 |
If only unanswerable questions are taken into account, the model with the best F1 Score is <a href="https://huggingface.co/hackathon-pln-es/roberta-base-biomedical-es-squad2-es">hackathon-pln-es/roberta-base-biomedical-es-squad2-es</a>.
|
| 88 |
+
|
| 89 |
+
|
| 90 |
The model <a href="https://huggingface.co/hackathon-pln-es/biomedtra-small-es-squad2-es">hackathon-pln-es/biomedtra-small-es-squad2-es</a>, on the contrary, shows inability to correctly identify unanswerable questions.
|
| 91 |
|
| 92 |
As future work, the following experiments could be carried out:
|