| |
|
| | --- |
| | tags: |
| | - bertopic |
| | library_name: bertopic |
| | pipeline_tag: text-classification |
| | --- |
| | |
| | # MARTINI_enrich_BERTopic_Fall_of_the_Cabal |
| |
|
| | This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model. |
| | BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets. |
| |
|
| | ## Usage |
| |
|
| | To use this model, please install BERTopic: |
| |
|
| | ``` |
| | pip install -U bertopic |
| | ``` |
| |
|
| | You can use the model as follows: |
| |
|
| | ```python |
| | from bertopic import BERTopic |
| | topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_Fall_of_the_Cabal") |
| | |
| | topic_model.get_topic_info() |
| | ``` |
| |
|
| | ## Topic overview |
| |
|
| | * Number of topics: 37 |
| | * Number of training documents: 5257 |
| |
|
| | <details> |
| | <summary>Click here for an overview of all topics.</summary> |
| | |
| | | Topic ID | Topic Keywords | Topic Frequency | Label | |
| | |----------|----------------|-----------------|-------| |
| | | -1 | vaccinated - covid - trump - niet - public | 20 | -1_vaccinated_covid_trump_niet | |
| | | 0 | gevaccineerden - ziekenhuis - niet - corona - maatregelen | 3055 | 0_gevaccineerden_ziekenhuis_niet_corona | |
| | | 1 | ukraine - putin - lukashenko - nordstream - sanctions | 198 | 1_ukraine_putin_lukashenko_nordstream | |
| | | 2 | veel - stembiljetten - gastvrijheid - nieuwsbrief - belangenverstrengeling | 155 | 2_veel_stembiljetten_gastvrijheid_nieuwsbrief | |
| | | 3 | glutathione - omicron - ivermectin - detoxify - hydroxychloroquine | 150 | 3_glutathione_omicron_ivermectin_detoxify | |
| | | 4 | vaccinated - pfizer - vaers - fatalities - 2021 | 128 | 4_vaccinated_pfizer_vaers_fatalities | |
| | | 5 | australiatimes - queensland - protesters - blockade - aborigines | 109 | 5_australiatimes_queensland_protesters_blockade | |
| | | 6 | boerburgerbeweging - netherlands - monsanto - farmer - bbb | 87 | 6_boerburgerbeweging_netherlands_monsanto_farmer | |
| | | 7 | amersfoort - raadhuisplein - rotterdam - woensdag - niet | 86 | 7_amersfoort_raadhuisplein_rotterdam_woensdag | |
| | | 8 | kindermisbruik - pedophilia - presentatieserie - indoctrinatie - sexualization | 84 | 8_kindermisbruik_pedophilia_presentatieserie_indoctrinatie | |
| | | 9 | co2 - climate - davos - twitterigtruth - 2030 | 82 | 9_co2_climate_davos_twitterigtruth | |
| | | 10 | trafficked - pedophilia - europol - philippines - helpline | 82 | 10_trafficked_pedophilia_europol_philippines | |
| | | 11 | fallcabal - rabobank - betalen - donate - kanaal | 69 | 11_fallcabal_rabobank_betalen_donate | |
| | | 12 | donald - ballots - newsmax - recount - maricopa | 65 | 12_donald_ballots_newsmax_recount | |
| | | 13 | sharetrump - prasident - censorship - announcement - omgosh | 64 | 13_sharetrump_prasident_censorship_announcement | |
| | | 14 | france - unvaccinated - lockdown - protests - liberte | 61 | 14_france_unvaccinated_lockdown_protests | |
| | | 15 | presentatie - live - cabal - ufo - katholieke | 60 | 15_presentatie_live_cabal_ufo | |
| | | 16 | trudeau - truckersforfreedom - winnipeg - confiscating - scotia | 59 | 16_trudeau_truckersforfreedom_winnipeg_confiscating | |
| | | 17 | zoompresentatie - zondag - overzicht - vrienden - reserveren | 53 | 17_zoompresentatie_zondag_overzicht_vrienden | |
| | | 18 | niet - joost - voorarrest - bodengraven - rechtbank | 53 | 18_niet_joost_voorarrest_bodengraven | |
| | | 19 | fauci - vaccine - remdesivir - bioterrorist - marburg | 49 | 19_fauci_vaccine_remdesivir_bioterrorist | |
| | | 20 | brainwashed - humanity - worthiness - ignorance - suffering | 44 | 20_brainwashed_humanity_worthiness_ignorance | |
| | | 21 | vaccinated - quackcine - tetanus - unenlightened - allergic | 40 | 21_vaccinated_quackcine_tetanus_unenlightened | |
| | | 22 | conspiracies - channel - believes - newspapers - biased | 39 | 22_conspiracies_channel_believes_newspapers | |
| | | 23 | netanyahu - israelische - vaccinated - התפרסם - הילדים | 36 | 23_netanyahu_israelische_vaccinated_התפרסם | |
| | | 24 | china - quarantine - shenzhen - riots - urumqi | 33 | 24_china_quarantine_shenzhen_riots | |
| | | 25 | cashless - worldwidedemonstration - withdrawing - monetary - worldcoin | 32 | 25_cashless_worldwidedemonstration_withdrawing_monetary | |
| | | 26 | wereldprotesten - switzerland - bundeshaus - almanya - reutlingen | 30 | 26_wereldprotesten_switzerland_bundeshaus_almanya | |
| | | 27 | pandemics - sovereignty - amendments - geneva - plenary | 29 | 27_pandemics_sovereignty_amendments_geneva | |
| | | 28 | russiagate - dossier - counterintelligence - subpoenas - probed | 29 | 28_russiagate_dossier_counterintelligence_subpoenas | |
| | | 29 | musk - followers - savior - posted - 4chan | 28 | 29_musk_followers_savior_posted | |
| | | 30 | shungite - amethyst - purifier - waterzuiveraar - absorberen | 27 | 30_shungite_amethyst_purifier_waterzuiveraar | |
| | | 31 | bidenlaptopreport - theepochtimes - storyhttps - investigator - delaware | 26 | 31_bidenlaptopreport_theepochtimes_storyhttps_investigator | |
| | | 32 | londonofficialworldwiderally - worldwidedemonstration - saveourrightsuk - walkout - 24th | 25 | 32_londonofficialworldwiderally_worldwidedemonstration_saveourrightsuk_walkout | |
| | | 33 | zionists - israeli - israelites - gaza - ishmael | 24 | 33_zionists_israeli_israelites_gaza | |
| | | 34 | jury - hearings - korona - livestream - nuremberg | 24 | 34_jury_hearings_korona_livestream | |
| | | 35 | epstein - pedophiles - ghislaine - coconspirators - clinton | 22 | 35_epstein_pedophiles_ghislaine_coconspirators | |
| | |
| | </details> |
| |
|
| | ## Training hyperparameters |
| |
|
| | * calculate_probabilities: True |
| | * language: None |
| | * low_memory: False |
| | * min_topic_size: 10 |
| | * n_gram_range: (1, 1) |
| | * nr_topics: None |
| | * seed_topic_list: None |
| | * top_n_words: 10 |
| | * verbose: False |
| | * zeroshot_min_similarity: 0.7 |
| | * zeroshot_topic_list: None |
| | |
| | ## Framework versions |
| | |
| | * Numpy: 1.26.4 |
| | * HDBSCAN: 0.8.40 |
| | * UMAP: 0.5.7 |
| | * Pandas: 2.2.3 |
| | * Scikit-Learn: 1.5.2 |
| | * Sentence-transformers: 3.3.1 |
| | * Transformers: 4.46.3 |
| | * Numba: 0.60.0 |
| | * Plotly: 5.24.1 |
| | * Python: 3.10.12 |
| | |