File size: 5,939 Bytes
19b102a 547860b 19b102a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 |
site_name: BERTopic
repo_url: https://github.com/MaartenGr/BERTopic
site_url: https://maartengr.github.io/BERTopic/
site_description: Leveraging BERT and a class-based TF-IDF to create easily interpretable topics.
site_author: Maarten P. Grootendorst
use_directory_urls: false
extra_css:
- stylesheets/extra.css
nav:
- Home: index.md
- The Algorithm: algorithm/algorithm.md
- Getting Started:
- Quick Start: getting_started/quickstart/quickstart.md
- Serialization: getting_started/serialization/serialization.md
- Search Topics: getting_started/search/search.md
- Best Practices: getting_started/best_practices/best_practices.md
- In-depth:
- Visualizations:
- Topics: getting_started/visualization/visualize_topics.md
- Documents: getting_started/visualization/visualize_documents.md
- Terms: getting_started/visualization/visualize_terms.md
- Hierarchy: getting_started/visualization/visualize_hierarchy.md
- Update Topics:
- Topic Reduction: getting_started/topicreduction/topicreduction.md
- Update Topic Representations: getting_started/topicrepresentation/topicrepresentation.md
- Outlier reduction: getting_started/outlier_reduction/outlier_reduction.md
- Parameter tuning: getting_started/parameter tuning/parametertuning.md
- Tips & Tricks: getting_started/tips_and_tricks/tips_and_tricks.md
- Sub-models:
- 1. Embeddings: getting_started/embeddings/embeddings.md
- 2. Dimensionality Reduction: getting_started/dim_reduction/dim_reduction.md
- 3. Clustering: getting_started/clustering/clustering.md
- 4. Vectorizers: getting_started/vectorizers/vectorizers.md
- 5. c-TF-IDF: getting_started/ctfidf/ctfidf.md
- 6. Fine-tune Topics:
- 6A. Representation Models: getting_started/representation/representation.md
- 6B. LLM & Generative AI: getting_started/representation/llm.md
- 6C. Multiple Representations: getting_started/multiaspect/multiaspect.md
- Variations:
- Dynamic Topic Modeling: getting_started/topicsovertime/topicsovertime.md
- Hierarchical Topic Modeling: getting_started/hierarchicaltopics/hierarchicaltopics.md
- Multimodal Topic Modeling: getting_started/multimodal/multimodal.md
- Online Topic Modeling: getting_started/online/online.md
- Merge Multiple Models: getting_started/merge/merge.md
- (semi)-supervised:
- Semi-supervised Topic Modeling: getting_started/semisupervised/semisupervised.md
- Supervised Topic Modeling: getting_started/supervised/supervised.md
- Manual Topic Modeling: getting_started/manual/manual.md
- Guided Topic Modeling: getting_started/guided/guided.md
- Zero-shot Topic Modeling: getting_started/zeroshot/zeroshot.md
- Topic Distributions: getting_started/distribution/distribution.md
- Topics per Class: getting_started/topicsperclass/topicsperclass.md
- Seed Words: getting_started/seed_words/seed_words.md
- FAQ: faq.md
- Use Cases: usecases.md
- API:
- BERTopic: api/bertopic.md
- Sub-models:
- Backends:
- Base: api/backends/base.md
- Word Doc: api/backends/word_doc.md
- OpenAI: api/backends/openai.md
- Cohere: api/backends/cohere.md
- Dimensionality Reduction:
- Base: api/dimensionality/base.md
- Clustering:
- Base: api/cluster/base.md
- Vectorizers:
- cTFIDF: api/ctfidf.md
- OnlineCountVectorizer: api/onlinecv.md
- Topic Representation:
- Base: api/representation/base.md
- MaximalMarginalRelevance: api/representation/mmr.md
- KeyBERT: api/representation/keybert.md
- PartOfSpeech: api/representation/pos.md
- Text Generation:
- 🤗 Transformers: api/representation/generation.md
- LangChain: api/representation/langchain.md
- Cohere: api/representation/cohere.md
- OpenAI: api/representation/openai.md
- Zero-shot Classification: api/representation/zeroshot.md
- Plotting:
- Barchart: api/plotting/barchart.md
- Documents: api/plotting/documents.md
- Documents with DataMapPlot: api/plotting/document_datamap.md
- DTM: api/plotting/dtm.md
- Hierarchical documents: api/plotting/hierarchical_documents.md
- Hierarchical topics: api/plotting/hierarchy.md
- Distribution: api/plotting/distribution.md
- Heatmap: api/plotting/heatmap.md
- Term Scores: api/plotting/term.md
- Topics: api/plotting/topics.md
- Topics per Class: api/plotting/topics_per_class.md
- Changelog: changelog.md
plugins:
- mkdocstrings:
watch:
- bertopic
- search
- social
copyright: Copyright © 2023 Maintained by <a href="https://github.com/MaartenGr">Maarten</a>.
theme:
custom_dir: images/
name: material
icon:
logo: material/library
font:
text: Ubuntu
code: Ubuntu Mono
favicon: icon.png
logo: img/icon.png
feature:
tabs: true
features:
- navigation.tabs
- navigation.sections
- navigation.instant
- navigation.top
- navigation.tracking
- toc.follow
- content.code.copy
palette:
- media: "(prefers-color-scheme: light)"
primary: custom
scheme: black
toggle:
icon: material/weather-sunny
name: Switch to dark mode
- media: "(prefers-color-scheme: dark)"
scheme: slate
primary: black
toggle:
icon: material/weather-night
name: Switch to light mode
markdown_extensions:
- admonition
- md_in_html
- pymdownx.details
- pymdownx.highlight
- pymdownx.superfences
- pymdownx.snippets
- toc:
permalink: true
|