MARTINI_enrich_BERTopic_orlaredchan

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_orlaredchan")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 30
  • Number of training documents: 4472
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 migrants - gardai - protest - covid - ngos 21 -1_migrants_gardai_protest_covid
0 illegals - refugee - irishtimes - nigeria - ukrainians 2925 0_illegals_refugee_irishtimes_nigeria
1 irishmen - ulster - nationalist - cailleach - surrender 164 1_irishmen_ulster_nationalist_cailleach
2 abduction - kilkenny - daughter - chased - garda 114 2_abduction_kilkenny_daughter_chased
3 raped - accused - convictions - jailed - pakistan 99 3_raped_accused_convictions_jailed
4 blanchardstown - drogheda - gangs - carraigaline - stabbed 98 4_blanchardstown_drogheda_gangs_carraigaline
5 africans - slavery - irishness - shelbourne - majekodunmi 89 5_africans_slavery_irishness_shelbourne
6 fraudsters - scammed - nigeria - laundering - suspect 88 6_fraudsters_scammed_nigeria_laundering
7 kildarenow - finglas - accommodation - protested - blessington 73 7_kildarenow_finglas_accommodation_protested
8 nationalities - census - scotland - percentage - majority 68 8_nationalities_census_scotland_percentage
9 censorship - criminalise - legislation - bunreacht - mcguirk 65 9_censorship_criminalise_legislation_bunreacht
10 quarantine - superspreading - tanaiste - airports - britain 57 10_quarantine_superspreading_tanaiste_airports
11 migrants - european - lukashenko - poland - merkel 51 11_migrants_european_lukashenko_poland
12 oats - silage - farmers - corrib - imported 48 12_oats_silage_farmers_corrib
13 transgenderism - pedophiles - heteronormativity - nhs - teachers 46 13_transgenderism_pedophiles_heteronormativity_nhs
14 vaccine - pfizer - gsk - snopes - waterford 46 14_vaccine_pfizer_gsk_snopes
15 france - zemmour - aoussaoui - islamist - rioting 42 15_france_zemmour_aoussaoui_islamist
16 murder - tullamore - jozef - ashling - jurors 40 16_murder_tullamore_jozef_ashling
17 irelandisfull - rents - migrants - ronan - soros 36 17_irelandisfull_rents_migrants_ronan
18 stabbings - gaelscoil - suspected - algerian - parnell 33 18_stabbings_gaelscoil_suspected_algerian
19 freeyoutubeapp - telegram - bigtaboochannel - uploaded - subscribers 32 19_freeyoutubeapp_telegram_bigtaboochannel_uploaded
20 paedophile - tatchell - lgbt - cunnilingus - advocated 31 20_paedophile_tatchell_lgbt_cunnilingus
21 volunteersforukraine - zelensky - putin - libya - nazification 30 21_volunteersforukraine_zelensky_putin_libya
22 asylum - kinnegad - committee - screened - hotels 30 22_asylum_kinnegad_committee_screened
23 protestors - ballsbridge - roadblocks - constituency - bohemians 28 23_protestors_ballsbridge_roadblocks_constituency
24 antifa - protesters - kilraine - idiots - corcoran 25 24_antifa_protesters_kilraine_idiots
25 birthright - deported - passport - niall - amendment 25 25_birthright_deported_passport_niall
26 shitlibs - retweeters - donegaldaily - hashtags - campaigned 24 26_shitlibs_retweeters_donegaldaily_hashtags
27 gypsies - slovakia - begging - robbing - discrimination 22 27_gypsies_slovakia_begging_robbing
28 mosques - clonskeagh - impic_ie - qadri - extremists 22 28_mosques_clonskeagh_impic_ie_qadri

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support