MARTINI_enrich_BERTopic_saintsandscholars

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_saintsandscholars")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 34
  • Number of training documents: 3747
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 catholics - saint - ireland - bishop - truth 20 -1_catholics_saint_ireland_bishop
0 abortions - infanticide - personhood - ireland - baby 2294 0_abortions_infanticide_personhood_ireland
1 repentance - psalm - chrysostom - virtues - humble 216 1_repentance_psalm_chrysostom_virtues
2 irish - nationalism - catholicism - multicultural - colonisation 77 2_irish_nationalism_catholicism_multicultural
3 hcg - vaccinated - aborted - injections - infertility 67 3_hcg_vaccinated_aborted_injections
4 lgbtqi - pornography - eireann - curriculum - parents 65 4_lgbtqi_pornography_eireann_curriculum
5 ballyfermot - irish - mcgeough - finucane - sectarian 63 5_ballyfermot_irish_mcgeough_finucane
6 priest - eglise - nantes - riots - stabbed 61 6_priest_eglise_nantes_riots
7 celtic - columba - clonmacnoise - lindisfarne - monasteries 58 7_celtic_columba_clonmacnoise_lindisfarne
8 sacredheartpilgrimage - rosary - laois - damien - november 57 8_sacredheartpilgrimage_rosary_laois_damien
9 pope - benedict - sedevacantists - cardinalate - santamaria 49 9_pope_benedict_sedevacantists_cardinalate
10 rosary - prayers - mondays - benediction - papalencyclicals 43 10_rosary_prayers_mondays_benediction
11 footballers - hypocrites - israel - munster - gay 42 11_footballers_hypocrites_israel_munster
12 rosary - blessed - communion - mary - annunciation 42 12_rosary_blessed_communion_mary
13 pelosi - abortion - vatican - veto - defunded 40 13_pelosi_abortion_vatican_veto
14 martyrs - legacy_irish - canonized - rathlin - plunkett 40 14_martyrs_legacy_irish_canonized_rathlin
15 immigrants - aquinas - commandment - patriotism - lawgivers 38 15_immigrants_aquinas_commandment_patriotism
16 ukraine - conspiracy - traitors - democracies - sources 38 16_ukraine_conspiracy_traitors_democracies
17 palestinians - zionist - jerusalem - evangelicals - islamist 37 17_palestinians_zionist_jerusalem_evangelicals
18 draghi - monsignor - vigano - covid - protests 37 18_draghi_monsignor_vigano_covid
19 pontiff - antichrist - fatima - consecration - miraculous 36 19_pontiff_antichrist_fatima_consecration
20 tyranny - incitement - offended - hatred - legislation 35 20_tyranny_incitement_offended_hatred
21 gospels - apostolic - epistle - doctrines - monophysitism 28 21_gospels_apostolic_epistle_doctrines
22 chesterton - virtues - foolish - chastises - renowned 27 22_chesterton_virtues_foolish_chastises
23 population - marriages - wales - 2022 - declining 25 23_population_marriages_wales_2022
24 kilmainham - padraig - friar - executions - requiem 25 24_kilmainham_padraig_friar_executions
25 atheism - existence - nominalism - sartre - infinity 25 25_atheism_existence_nominalism_sartre
26 irishmirror - mullingar - migrant - protests - baldoyle 25 26_irishmirror_mullingar_migrant_protests
27 migrants - deportations - homeless - irish - millions 24 27_migrants_deportations_homeless_irish
28 catechisms - papalencyclicals - sean - infallible - resource 24 28_catechisms_papalencyclicals_sean_infallible
29 prayer - archangel - satan - michael - immaculate 23 29_prayer_archangel_satan_michael
30 aoibhneas - mbeannaithear - uilechumhachtach - throcaireach - aonmhac 22 30_aoibhneas_mbeannaithear_uilechumhachtach_throcaireach
31 saints - bosco - podcast - subscribed - friendsofaquinas 22 31_saints_bosco_podcast_subscribed
32 pope - bishops - heretics - hypocrite - sspx 22 32_pope_bishops_heretics_hypocrite

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support