Add BERTopic model
Browse files- README.md +214 -0
- config.json +16 -0
- ctfidf.safetensors +3 -0
- ctfidf_config.json +0 -0
- topic_embeddings.safetensors +3 -0
- topics.json +0 -0
README.md
ADDED
|
@@ -0,0 +1,214 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
---
|
| 3 |
+
tags:
|
| 4 |
+
- bertopic
|
| 5 |
+
library_name: bertopic
|
| 6 |
+
pipeline_tag: text-classification
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
# MARTINI_enrich_BERTopic_candlesinthenight
|
| 10 |
+
|
| 11 |
+
This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
|
| 12 |
+
BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
|
| 13 |
+
|
| 14 |
+
## Usage
|
| 15 |
+
|
| 16 |
+
To use this model, please install BERTopic:
|
| 17 |
+
|
| 18 |
+
```
|
| 19 |
+
pip install -U bertopic
|
| 20 |
+
```
|
| 21 |
+
|
| 22 |
+
You can use the model as follows:
|
| 23 |
+
|
| 24 |
+
```python
|
| 25 |
+
from bertopic import BERTopic
|
| 26 |
+
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_candlesinthenight")
|
| 27 |
+
|
| 28 |
+
topic_model.get_topic_info()
|
| 29 |
+
```
|
| 30 |
+
|
| 31 |
+
## Topic overview
|
| 32 |
+
|
| 33 |
+
* Number of topics: 145
|
| 34 |
+
* Number of training documents: 22892
|
| 35 |
+
|
| 36 |
+
<details>
|
| 37 |
+
<summary>Click here for an overview of all topics.</summary>
|
| 38 |
+
|
| 39 |
+
| Topic ID | Topic Keywords | Topic Frequency | Label |
|
| 40 |
+
|----------|----------------|-----------------|-------|
|
| 41 |
+
| -1 | donald - patriots - tweet - ballots - 2020 | 20 | -1_donald_patriots_tweet_ballots |
|
| 42 |
+
| 0 | desantis - romney - governors - desanctimonious - voted | 14610 | 0_desantis_romney_governors_desanctimonious |
|
| 43 |
+
| 1 | bodycam - shootings - antifa - officers - taser | 261 | 1_bodycam_shootings_antifa_officers |
|
| 44 |
+
| 2 | ukraine - donetsk - mariupol - zelenskyy - volodymyr | 244 | 2_ukraine_donetsk_mariupol_zelenskyy |
|
| 45 |
+
| 3 | faith - miracles - satan - bible - always | 226 | 3_faith_miracles_satan_bible |
|
| 46 |
+
| 4 | fbi - insurrectionists - pelosi - carlson - informant | 195 | 4_fbi_insurrectionists_pelosi_carlson |
|
| 47 |
+
| 5 | jpmorgan - epstein - billionaire - 290million - ghislaine | 170 | 5_jpmorgan_epstein_billionaire_290million |
|
| 48 |
+
| 6 | prayers - clots - healed - brother - icu | 166 | 6_prayers_clots_healed_brother |
|
| 49 |
+
| 7 | republicans - impeachment - mccarthy - committees - newhouse | 154 | 7_republicans_impeachment_mccarthy_committees |
|
| 50 |
+
| 8 | biden - joey - inauguration - donmeswitwomenofmericalessuwannagetthebenefit - dammmmm | 149 | 8_biden_joey_inauguration_donmeswitwomenofmericalessuwannagetthebenefit |
|
| 51 |
+
| 9 | georgia - lawyers - whistleblower - dismissed - freekyleusa | 145 | 9_georgia_lawyers_whistleblower_dismissed |
|
| 52 |
+
| 10 | flynn - prayer - archangel - michael - redstatetalkradio | 142 | 10_flynn_prayer_archangel_michael |
|
| 53 |
+
| 11 | hannity - newsmax - interviewed - president - klowdtv | 129 | 11_hannity_newsmax_interviewed_president |
|
| 54 |
+
| 12 | trumpo - mnuchin - lunatics - kristie - ahhhhhhhhh | 117 | 12_trumpo_mnuchin_lunatics_kristie |
|
| 55 |
+
| 13 | cnn - carlson - murdoch - timewarner - shareholder | 116 | 13_cnn_carlson_murdoch_timewarner |
|
| 56 |
+
| 14 | taiwan - pompeo - apec - xiaoping - weibo | 116 | 14_taiwan_pompeo_apec_xiaoping |
|
| 57 |
+
| 15 | tweets - fbi - misinformation - blacklists - slack | 112 | 15_tweets_fbi_misinformation_blacklists |
|
| 58 |
+
| 16 | georgenews - zoomgov - 2021 - teleprompter - 45pm | 109 | 16_georgenews_zoomgov_2021_teleprompter |
|
| 59 |
+
| 17 | illegals - deportation - migrant - reynosa - border | 101 | 17_illegals_deportation_migrant_reynosa |
|
| 60 |
+
| 18 | dan - scavino - retweeted - 17am - booooooooooooooooom | 96 | 18_dan_scavino_retweeted_17am |
|
| 61 |
+
| 19 | maricopa - ballots - auditors - subpoenaed - counties | 95 | 19_maricopa_ballots_auditors_subpoenaed |
|
| 62 |
+
| 20 | pfizer - vaccinate - cdc - 125742_s1_m5_crf_c4591001 - pages | 92 | 20_pfizer_vaccinate_cdc_125742_s1_m5_crf_c4591001 |
|
| 63 |
+
| 21 | clinton - mueller - spygate - dossier - danchenko | 89 | 21_clinton_mueller_spygate_dossier |
|
| 64 |
+
| 22 | lindelltv - mike - videolink - frankspeech - netstumbler | 88 | 22_lindelltv_mike_videolink_frankspeech |
|
| 65 |
+
| 23 | fauci - bioterrorism - coronaviruses - darpa - usaid | 87 | 23_fauci_bioterrorism_coronaviruses_darpa |
|
| 66 |
+
| 24 | unvaxed - nsw - lockdown - frederiksen - mandatory | 86 | 24_unvaxed_nsw_lockdown_frederiksen |
|
| 67 |
+
| 25 | trumps - prosecutors - indicted - travesty - manhattan | 84 | 25_trumps_prosecutors_indicted_travesty |
|
| 68 |
+
| 26 | arrests - polk - grady - pornography - abused | 83 | 26_arrests_polk_grady_pornography |
|
| 69 |
+
| 27 | qanon - qr - greatawakening - posts - decoding | 77 | 27_qanon_qr_greatawakening_posts |
|
| 70 |
+
| 28 | davos - climate - alarmist - dictatorships - globalized | 76 | 28_davos_climate_alarmist_dictatorships |
|
| 71 |
+
| 29 | night - blessed - everything - unshakable - hey | 74 | 29_night_blessed_everything_unshakable |
|
| 72 |
+
| 30 | chats - 10pm - voice - friday - live | 74 | 30_chats_10pm_voice_friday |
|
| 73 |
+
| 31 | freedom - allegiance - fightback - traitors - reagan | 73 | 31_freedom_allegiance_fightback_traitors |
|
| 74 |
+
| 32 | psalm - nebuchadnezzar - shadrach - isaiah - righteous | 71 | 32_psalm_nebuchadnezzar_shadrach_isaiah |
|
| 75 |
+
| 33 | republican - votes - mccormick - fetterman - leads | 71 | 33_republican_votes_mccormick_fetterman |
|
| 76 |
+
| 34 | donald - staten - 2021 - 45th - july | 71 | 34_donald_staten_2021_45th |
|
| 77 |
+
| 35 | qanon - traffickers - ballard - rescued - sound | 71 | 35_qanon_traffickers_ballard_rescued |
|
| 78 |
+
| 36 | georgia - ballots - falsified - raffensperger - scanned | 70 | 36_georgia_ballots_falsified_raffensperger |
|
| 79 |
+
| 37 | followers - mjtruth - retweets - hahahahahhah - rejoin | 69 | 37_followers_mjtruth_retweets_hahahahahhah |
|
| 80 |
+
| 38 | prayer - satan - heavenly - amen - forgive | 68 | 38_prayer_satan_heavenly_amen |
|
| 81 |
+
| 39 | vaers - myocarditis - unvaccinated - strokes - miscarriages | 68 | 39_vaers_myocarditis_unvaccinated_strokes |
|
| 82 |
+
| 40 | jamesokeefe - whistleblower - americanmuckraker - veritastips - peoject | 67 | 40_jamesokeefe_whistleblower_americanmuckraker_veritastips |
|
| 83 |
+
| 41 | fbi - melania - raided - documents - unsealed | 67 | 41_fbi_melania_raided_documents |
|
| 84 |
+
| 42 | gasoline - bidenflation - prices - utah - mile | 66 | 42_gasoline_bidenflation_prices_utah |
|
| 85 |
+
| 43 | ukraine - appropriations - billion - taxpayers - yellen | 65 | 43_ukraine_appropriations_billion_taxpayers |
|
| 86 |
+
| 44 | trudeau - tucker - thegatewaypundit - convoy - protesters | 64 | 44_trudeau_tucker_thegatewaypundit_convoy |
|
| 87 |
+
| 45 | derailment - spills - ohio - hazardous - propylene | 63 | 45_derailment_spills_ohio_hazardous |
|
| 88 |
+
| 46 | afghanistan - terrorists - amrullah - donald - surrender | 62 | 46_afghanistan_terrorists_amrullah_donald |
|
| 89 |
+
| 47 | musk - twitterverse - omega - shadowbanned - shatner | 62 | 47_musk_twitterverse_omega_shadowbanned |
|
| 90 |
+
| 48 | fetterman - oprah - impeaching - democrat - candidates | 62 | 48_fetterman_oprah_impeaching_democrat |
|
| 91 |
+
| 49 | ivermectin - hydroxychloroquine - remdesivir - antidote - fenbendazole | 61 | 49_ivermectin_hydroxychloroquine_remdesivir_antidote |
|
| 92 |
+
| 50 | blackpilling - everyone - channel - welcome - maqeveli | 61 | 50_blackpilling_everyone_channel_welcome |
|
| 93 |
+
| 51 | republicans - scgop - carolina - mckissick - chairman | 61 | 51_republicans_scgop_carolina_mckissick |
|
| 94 |
+
| 52 | bidenlaptopemails - clancy - hooker - scandal - kgb | 61 | 52_bidenlaptopemails_clancy_hooker_scandal |
|
| 95 |
+
| 53 | plandemic - unvaccinated - ivermectin - indoctrinating - audited | 60 | 53_plandemic_unvaccinated_ivermectin_indoctrinating |
|
| 96 |
+
| 54 | truthsocial - update - bots - wtaf - app | 60 | 54_truthsocial_update_bots_wtaf |
|
| 97 |
+
| 55 | vaccine - pfizer - lockdowns - pepematters - poisoned | 58 | 55_vaccine_pfizer_lockdowns_pepematters |
|
| 98 |
+
| 56 | vaccinated - fauci - scammers - diphtheria - whaaaaaaaaaaaaaaaaaaat | 57 | 56_vaccinated_fauci_scammers_diphtheria |
|
| 99 |
+
| 57 | spammers - banned - channel - replies - sorrrryyy | 57 | 57_spammers_banned_channel_replies |
|
| 100 |
+
| 58 | auditthevotepa - senators - harrisburg - petition - fax | 55 | 58_auditthevotepa_senators_harrisburg_petition |
|
| 101 |
+
| 59 | pelosi - sfpd - sleezeball - drunk - nudist | 55 | 59_pelosi_sfpd_sleezeball_drunk |
|
| 102 |
+
| 60 | wildfires - fema - evacuated - kelowna - hawaii | 53 | 60_wildfires_fema_evacuated_kelowna |
|
| 103 |
+
| 61 | rally - newsmax - 7pm - airing - rewatch | 52 | 61_rally_newsmax_7pm_airing |
|
| 104 |
+
| 62 | comey - snowden - federalized - nsa - soros | 52 | 62_comey_snowden_federalized_nsa |
|
| 105 |
+
| 63 | trumper - vivek - candidates - rnc - goviralfg | 50 | 63_trumper_vivek_candidates_rnc |
|
| 106 |
+
| 64 | arizona - gubernatorial - kelli - recount - judge | 48 | 64_arizona_gubernatorial_kelli_recount |
|
| 107 |
+
| 65 | hurricanes - rain - cirrus - manipulating - sarasota | 47 | 65_hurricanes_rain_cirrus_manipulating |
|
| 108 |
+
| 66 | night - faith - unshakeable - little - brightly | 45 | 66_night_faith_unshakeable_little |
|
| 109 |
+
| 67 | wisvote - ballots - milwaukee - tabulators - zuckerberg | 44 | 67_wisvote_ballots_milwaukee_tabulators |
|
| 110 |
+
| 68 | queen - diana - raven - sandringham - mi6 | 44 | 68_queen_diana_raven_sandringham |
|
| 111 |
+
| 69 | deltas - qdrop - july - dday - 2378713 | 44 | 69_deltas_qdrop_july_dday |
|
| 112 |
+
| 70 | sexualize - blowjobs - teachers - lgbtq - pornstache | 43 | 70_sexualize_blowjobs_teachers_lgbtq |
|
| 113 |
+
| 71 | voting - california - newsom - cheated - welllll | 41 | 71_voting_california_newsom_cheated |
|
| 114 |
+
| 72 | congressman - nebraska - endorsed - steve - carey | 40 | 72_congressman_nebraska_endorsed_steve |
|
| 115 |
+
| 73 | soooooo - reposting - normies - tried - tineye | 40 | 73_soooooo_reposting_normies_tried |
|
| 116 |
+
| 74 | mobilized - ng - preparedness - marines - fema | 40 | 74_mobilized_ng_preparedness_marines |
|
| 117 |
+
| 75 | vid - youtube - rewatched - posted - click | 39 | 75_vid_youtube_rewatched_posted |
|
| 118 |
+
| 76 | cryptocurrencies - hodl - officialsatoshi - bny - rupee | 39 | 76_cryptocurrencies_hodl_officialsatoshi_bny |
|
| 119 |
+
| 77 | jokingly - sjokeyjoke - laughed - fartwell - tickled | 39 | 77_jokingly_sjokeyjoke_laughed_fartwell |
|
| 120 |
+
| 78 | resigns - chancellor - parliamentary - brexit - javid | 39 | 78_resigns_chancellor_parliamentary_brexit |
|
| 121 |
+
| 79 | transgender - minors - wyoming - obamacare - vetoed | 38 | 79_transgender_minors_wyoming_obamacare |
|
| 122 |
+
| 80 | vaccine - mandate - scotusblog - employers - louisiana | 38 | 80_vaccine_mandate_scotusblog_employers |
|
| 123 |
+
| 81 | bolsonaro - brasilia - janeiro - moraes - peruvian | 37 | 81_bolsonaro_brasilia_janeiro_moraes |
|
| 124 |
+
| 82 | republicans - manchin - voted - earmarks - billion | 36 | 82_republicans_manchin_voted_earmarks |
|
| 125 |
+
| 83 | 1111 - synchronicity - uhhhhhhh - vibration - __hz | 35 | 83_1111_synchronicity_uhhhhhhh_vibration |
|
| 126 |
+
| 84 | george - posted - graphica - 15pm - 2022 | 35 | 84_george_posted_graphica_15pm |
|
| 127 |
+
| 85 | fluoridated - cadmium - doritos - microplastics - unhealthy | 35 | 85_fluoridated_cadmium_doritos_microplastics |
|
| 128 |
+
| 86 | hillary - comeback - evidently - 2016 - tweets | 34 | 86_hillary_comeback_evidently_2016 |
|
| 129 |
+
| 87 | vaccinations - eeoc - mandatory - exempt - hipaa | 34 | 87_vaccinations_eeoc_mandatory_exempt |
|
| 130 |
+
| 88 | georgia - kemp - vernon - deputy - audit | 33 | 88_georgia_kemp_vernon_deputy |
|
| 131 |
+
| 89 | mjtruthhomefront - covfefe - plandemic - 2022 - zelensky | 33 | 89_mjtruthhomefront_covfefe_plandemic_2022 |
|
| 132 |
+
| 90 | kanye - mkultra - celebrities - ricky - jews | 33 | 90_kanye_mkultra_celebrities_ricky |
|
| 133 |
+
| 91 | george - headlines - chats - wwg - 55pm | 33 | 91_george_headlines_chats_wwg |
|
| 134 |
+
| 92 | pandemics - h5n1 - deadlier - brussels - hopkins | 33 | 92_pandemics_h5n1_deadlier_brussels |
|
| 135 |
+
| 93 | stocked - kroger - status - shortage - farmers | 33 | 93_stocked_kroger_status_shortage |
|
| 136 |
+
| 94 | mcafee - deadman - countdown - killswitch - activated | 32 | 94_mcafee_deadman_countdown_killswitch |
|
| 137 |
+
| 95 | texas - illegals - governor - federalizes - standoff | 32 | 95_texas_illegals_governor_federalizes |
|
| 138 |
+
| 96 | stol - deleted - covfefe - impeached - 2023 | 31 | 96_stol_deleted_covfefe_impeached |
|
| 139 |
+
| 97 | hamas - gaza - israelis - ashkelon - missiles | 31 | 97_hamas_gaza_israelis_ashkelon |
|
| 140 |
+
| 98 | telegram - messaged - signal - deactivated - unsubscribe | 31 | 98_telegram_messaged_signal_deactivated |
|
| 141 |
+
| 99 | abortions - banning - mccorvey - enacted - ab2223 | 31 | 99_abortions_banning_mccorvey_enacted |
|
| 142 |
+
| 100 | gunmakers - militia - maryland - rulings - unconstitutional | 30 | 100_gunmakers_militia_maryland_rulings |
|
| 143 |
+
| 101 | pontiff - vatican - benedict - basilica - vigano | 30 | 101_pontiff_vatican_benedict_basilica |
|
| 144 |
+
| 102 | reelection - republican - wins - rubio - alabama | 29 | 102_reelection_republican_wins_rubio |
|
| 145 |
+
| 103 | norad - airship - drones - balloon - hypersonic | 29 | 103_norad_airship_drones_balloon |
|
| 146 |
+
| 104 | zuckerberg - whatapp - surveil - darpa - fakebook | 29 | 104_zuckerberg_whatapp_surveil_darpa |
|
| 147 |
+
| 105 | musk - shareholders - twatter - acquisition - vanguard | 29 | 105_musk_shareholders_twatter_acquisition |
|
| 148 |
+
| 106 | republicans - mcconnell - filibuster - senators - trillion | 29 | 106_republicans_mcconnell_filibuster_senators |
|
| 149 |
+
| 107 | borders - tijuana - immigrant - smuggling - criminals | 28 | 107_borders_tijuana_immigrant_smuggling |
|
| 150 |
+
| 108 | memorial - veterans - honor - blessings - liberty | 28 | 108_memorial_veterans_honor_blessings |
|
| 151 |
+
| 109 | mj - illegals - netanyahu - 2023 - happenings | 28 | 109_mj_illegals_netanyahu_2023 |
|
| 152 |
+
| 110 | trafficked - whistleblower - adoptions - maltreatment - bama | 28 | 110_trafficked_whistleblower_adoptions_maltreatment |
|
| 153 |
+
| 111 | lewinsky - whore - donna - carrol - defamed | 27 | 111_lewinsky_whore_donna_carrol |
|
| 154 |
+
| 112 | pelosi - unconstitutionally - subpoena - january - newsflash | 27 | 112_pelosi_unconstitutionally_subpoena_january |
|
| 155 |
+
| 113 | bidens - 40million - laundered - deposited - llc | 27 | 113_bidens_40million_laundered_deposited |
|
| 156 |
+
| 114 | communist - khrushchev - goals - disarmament - infiltrate | 27 | 114_communist_khrushchev_goals_disarmament |
|
| 157 |
+
| 115 | walmart - boycotting - businesses - blackrock - dunkin | 27 | 115_walmart_boycotting_businesses_blackrock |
|
| 158 |
+
| 116 | hindenburg - hitler - brainwashing - wolff - lifeboat | 26 | 116_hindenburg_hitler_brainwashing_wolff |
|
| 159 |
+
| 117 | bullion - dollar - rubles - sterling - revalued | 26 | 117_bullion_dollar_rubles_sterling |
|
| 160 |
+
| 118 | goodnight - blessed - 4am - sunday - birthday | 26 | 118_goodnight_blessed_4am_sunday |
|
| 161 |
+
| 119 | hey - tomorrow - busy - missed - hopefully | 26 | 119_hey_tomorrow_busy_missed |
|
| 162 |
+
| 120 | suez - maersk - tugboats - blockade - container | 25 | 120_suez_maersk_tugboats_blockade |
|
| 163 |
+
| 121 | swimmers - transgender - athlete - ncaa - disqualified | 25 | 121_swimmers_transgender_athlete_ncaa |
|
| 164 |
+
| 122 | actblue - donations - earmarks - laundering - schwarzman | 25 | 122_actblue_donations_earmarks_laundering |
|
| 165 |
+
| 123 | planecrash - flightradar - bomber - northrop - f35 | 25 | 123_planecrash_flightradar_bomber_northrop |
|
| 166 |
+
| 124 | vibes - good - cutest - gif - smiling | 25 | 124_vibes_good_cutest_gif |
|
| 167 |
+
| 125 | night - faith - alright - unshakable - woooooooooo | 24 | 125_night_faith_alright_unshakable |
|
| 168 |
+
| 126 | maricopa - auditors - tampered - cyber - password | 24 | 126_maricopa_auditors_tampered_cyber |
|
| 169 |
+
| 127 | linwoodspeakstruth - redeemed - pastor - david - caroline | 24 | 127_linwoodspeakstruth_redeemed_pastor_david |
|
| 170 |
+
| 128 | lockdowns - beijing - sichuan - luzhou - urumqi | 24 | 128_lockdowns_beijing_sichuan_luzhou |
|
| 171 |
+
| 129 | blessings - prayer - thankful - verses - sunday | 24 | 129_blessings_prayer_thankful_verses |
|
| 172 |
+
| 130 | spacex - satellites - missile - huntsville - cyberattacks | 24 | 130_spacex_satellites_missile_huntsville |
|
| 173 |
+
| 131 | defcon - arrest - inauguration - broadcast - resignations | 23 | 131_defcon_arrest_inauguration_broadcast |
|
| 174 |
+
| 132 | george - chats - posted - unshakable - 2022 | 23 | 132_george_chats_posted_unshakable |
|
| 175 |
+
| 133 | speaker - gop - livestream - delegates - 5pm | 23 | 133_speaker_gop_livestream_delegates |
|
| 176 |
+
| 134 | trumpers - gabbard - mike - nominee - overturned | 23 | 134_trumpers_gabbard_mike_nominee |
|
| 177 |
+
| 135 | georgetb - today - images - 48pm - 2094 | 22 | 135_georgetb_today_images_48pm |
|
| 178 |
+
| 136 | ufos - alienleaks - mothership - nordstream - reconnaissance | 22 | 136_ufos_alienleaks_mothership_nordstream |
|
| 179 |
+
| 137 | handsome - loves - wooooo - husband - patriot | 21 | 137_handsome_loves_wooooo_husband |
|
| 180 |
+
| 138 | ncov - swabs - test - contagious - diagnosed | 21 | 138_ncov_swabs_test_contagious |
|
| 181 |
+
| 139 | explosion - refinery - pemex - louisiana - hazmat | 21 | 139_explosion_refinery_pemex_louisiana |
|
| 182 |
+
| 140 | died - myocarditis - vaccinated - stroke - moussa | 21 | 140_died_myocarditis_vaccinated_stroke |
|
| 183 |
+
| 141 | ballots - pennsylvania - countywide - scanned - undelivered | 21 | 141_ballots_pennsylvania_countywide_scanned |
|
| 184 |
+
| 142 | wikileaks - pompeo - extradited - julian - assassinate | 20 | 142_wikileaks_pompeo_extradited_julian |
|
| 185 |
+
| 143 | milley - mattis - repdonaldspress - soleimani - treasonous | 20 | 143_milley_mattis_repdonaldspress_soleimani |
|
| 186 |
+
|
| 187 |
+
</details>
|
| 188 |
+
|
| 189 |
+
## Training hyperparameters
|
| 190 |
+
|
| 191 |
+
* calculate_probabilities: True
|
| 192 |
+
* language: None
|
| 193 |
+
* low_memory: False
|
| 194 |
+
* min_topic_size: 10
|
| 195 |
+
* n_gram_range: (1, 1)
|
| 196 |
+
* nr_topics: None
|
| 197 |
+
* seed_topic_list: None
|
| 198 |
+
* top_n_words: 10
|
| 199 |
+
* verbose: False
|
| 200 |
+
* zeroshot_min_similarity: 0.7
|
| 201 |
+
* zeroshot_topic_list: None
|
| 202 |
+
|
| 203 |
+
## Framework versions
|
| 204 |
+
|
| 205 |
+
* Numpy: 1.26.4
|
| 206 |
+
* HDBSCAN: 0.8.40
|
| 207 |
+
* UMAP: 0.5.7
|
| 208 |
+
* Pandas: 2.2.3
|
| 209 |
+
* Scikit-Learn: 1.5.2
|
| 210 |
+
* Sentence-transformers: 3.3.1
|
| 211 |
+
* Transformers: 4.46.3
|
| 212 |
+
* Numba: 0.60.0
|
| 213 |
+
* Plotly: 5.24.1
|
| 214 |
+
* Python: 3.10.12
|
config.json
ADDED
|
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"calculate_probabilities": true,
|
| 3 |
+
"language": null,
|
| 4 |
+
"low_memory": false,
|
| 5 |
+
"min_topic_size": 10,
|
| 6 |
+
"n_gram_range": [
|
| 7 |
+
1,
|
| 8 |
+
1
|
| 9 |
+
],
|
| 10 |
+
"nr_topics": null,
|
| 11 |
+
"seed_topic_list": null,
|
| 12 |
+
"top_n_words": 10,
|
| 13 |
+
"verbose": false,
|
| 14 |
+
"zeroshot_min_similarity": 0.7,
|
| 15 |
+
"zeroshot_topic_list": null
|
| 16 |
+
}
|
ctfidf.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a1370e9a5bff54be83fa077ddb0e2da01d2a52762f4b24758144747c0b0c3baf
|
| 3 |
+
size 2277740
|
ctfidf_config.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
topic_embeddings.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:545da3ecaad39207cc3d481366f31cf8ccb7a0f496ace4132a13e25e4225eedf
|
| 3 |
+
size 594016
|
topics.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|