| <!DOCTYPE html> |
| <html> |
| <head> |
| <title>Bootstrap Online Editor</title> |
| <meta name="viewport" content="width=device-width, initial-scale=1"> |
| <link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0/css/bootstrap.min.css"> |
| <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.6.0/jquery.min.js"></script> |
| <script src="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0/js/bootstrap.min.js"></script> |
| </head> |
| <body> |
|
|
| <div class="container"> |
| <hr> |
| <h2 style="text-align: center;">NLPre-PL Dataset</h2> |
| <hr> |
| <p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus - the 1 million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus Jezyka Polskiego). |
| </p> |
| <p> |
| The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time. |
| </p> |
| <p> |
| <a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 NLPre-PL Dataset</a> |
| |
| <a style="text-align: center;"href="http://git.nlp.ipipan.waw.pl/alina/PDBUD" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 PDB-UD Dataset</a> |
| </p> |
| |
| <div><p></p></div> |
|
|
| <div class="container"> |
| <hr> |
| <h2 style="text-align: center;">NLPre-PL Trained models</h2> |
| <hr> |
| <p>Here are listed all available models, trained for the purpouse of creating NLPre-PL Benchmark.</p> |
|
|
| <div class="accordion" id="accordionExample"> |
| <div class="card"> |
| <div class="card-header" id="headingOne"> |
| <h5 class="mb-0"> |
| <button class="btn btn-link" type="button" data-toggle="collapse" data-target="#collapseOne" aria-expanded="false" aria-controls="collapseOne"> |
| 🤗 COMBO |
| </button> |
| </h5> |
| </div> |
|
|
| <div id="collapseOne" class="collapse show" aria-labelledby="headingOne" data-parent="#accordionExample"> |
| <hr> |
| <h5 style="text-align: center;">UD TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_pdb" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + PDB-UD</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + PDB-UD </a></li> |
|
|
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li> |
| </ul> |
| |
| |
| <hr> |
| <h5 style="text-align: center;">NKJP TAGSET</h4> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li> |
| </ul> |
| |
| </div> |
| </div> |
| <div class="card"> |
| <div class="card-header" id="headingTwo"> |
| <h5 class="mb-0"> |
| <button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseTwo" aria-expanded="false" aria-controls="collapseTwo"> |
| 🤗 SPACY |
| </button> |
| </h5> |
| </div> |
| <div id="collapseTwo" class="collapse" aria-labelledby="headingTwo" data-parent="#accordionExample"> |
| <div class="card-body"> |
| <hr> |
| <h5 style="text-align: center;">UD TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + PDB-UD</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >spaCy + fasttext + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + PDB-UD</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + PDB-UD </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-type </a></li> |
| </ul> |
| |
| <hr> |
| <h5 style="text-align: center;">NKJP TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_transformer_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_transformer_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_pl-core-news-lg_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_pl-core-news-lg_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-type </a></li> |
| </ul> |
| </div> |
| </div> |
| </div> |
| <div class="card"> |
| <div class="card-header" id="headingThree"> |
| <h5 class="mb-0"> |
| <button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseThree" aria-expanded="false" aria-controls="collapseThree"> |
| 🤗 STANZA |
| </button> |
| </h5> |
| </div> |
| <div id="collapseThree" class="collapse" aria-labelledby="headingThree" data-parent="#accordionExample"> |
| <div class="card-body"> |
| <hr> |
| <h5 style="text-align: center;">UD TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + PDB-UD</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >Stanza + fasttext + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-type </a></li> |
|
|
| </ul> |
| |
| |
| <hr> |
| <h5 style="text-align: center;">NKJP TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-type </a></li> |
|
|
| </ul> |
| </div> |
| </div> |
| </div> |
| <div class="card"> |
| <div class="card-header" id="headingFour"> |
| <h5 class="mb-0"> |
| <button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseFour" aria-expanded="false" aria-controls="collapseThree"> |
| 🤗 TRANKIT |
| </button> |
| </h5> |
| </div> |
| <div id="collapseFour" class="collapse" aria-labelledby="headingFour" data-parent="#accordionExample"> |
| <div class="card-body"> |
| <hr> |
| <h5 style="text-align: center;">UD TAGSET</h5> |
| <hr> |
| |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + PDB-UD</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + PDB-UD </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-name </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-type </a></li> |
| </ul> |
| |
| <hr> |
| <h5 style="text-align: center;">NKJP TAGSET</h5> |
| <hr> |
| <ul class="list-group list-group-light list-group-small"> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-large_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-large_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-type </a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-base_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-name</a></li> |
| <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-base_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-type </a></li> |
| </ul> |
| </div> |
| </div> |
| </div> |
| </div> |
|
|
| </div> |
|
|
| </body> |
| </html> |