CCRss commited on
Commit
540c906
·
verified ·
1 Parent(s): 2e42b0c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md CHANGED
@@ -8,8 +8,37 @@ datasets:
8
  - CCRss/arxiv_papers_cs
9
  ---
10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  # Top2Vec Scientific Texts Model
12
 
 
 
13
  This repository hosts the `top2vec_scientific_texts` model, a specialized Top2Vec model trained on scientific texts for topic modeling and semantic search.
14
 
15
  ## Model Overview
@@ -86,6 +115,24 @@ This graph shows the trend of interest in the use of UAVs in disaster and emerge
86
 
87
  ### Key Metrics Table
88
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
89
  ## Contributions
90
 
91
  We welcome contributions to the top2vec_scientific_texts model. If you have suggestions, improvements, or encounter any issues, please feel free to open an issue or submit a pull request.
 
8
  - CCRss/arxiv_papers_cs
9
  ---
10
 
11
+ <!DOCTYPE html>
12
+ <html>
13
+ <head>
14
+ <meta charset="UTF-8">
15
+ <meta name="viewport" content="width=device-width, initial-scale=1.0">
16
+ <meta http-equiv="X-UA-Compatible" content="ie=edge">
17
+ <title>Markmap</title>
18
+ <style>
19
+ * {
20
+ margin: 0;
21
+ padding: 0;
22
+ }
23
+ #mindmap {
24
+ display: block;
25
+ width: 100vw;
26
+ height: 100vh;
27
+ }
28
+ </style>
29
+ <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/markmap-toolbar@0.16.0/dist/style.css">
30
+ </head>
31
+ <body>
32
+ <svg id="mindmap"></svg>
33
+ <script src="https://cdn.jsdelivr.net/npm/d3@7.8.5/dist/d3.min.js"></script><script src="https://cdn.jsdelivr.net/npm/markmap-view@0.16.0/dist/browser/index.js"></script><script src="https://cdn.jsdelivr.net/npm/markmap-toolbar@0.16.0/dist/index.js"></script><script>(()=>{setTimeout(()=>{const{markmap:H,mm:ae}=window,W=new H.Toolbar;W.attach(ae);const we=W.render();we.setAttribute("style","position:absolute;bottom:20px;right:20px"),document.body.append(we)})})()</script><script>((o,T,c,r)=>{const g=o();window.mm=g.Markmap.create("svg#mindmap",(T||g.deriveOptions)(r),c)})(()=>window.markmap,null,{"content":"Top2Vec Model for Scientific Texts","children":[{"content":"Overview","children":[{"content":"<strong>Purpose:</strong> Analyze scientific texts for topic modeling and semantic search.","children":[],"payload":{"lines":"3,4"}},{"content":"<strong>Model:</strong> Top2Vec with Universal Sentence Encoder.","children":[],"payload":{"lines":"4,5"}},{"content":"<strong>Domain:</strong> Scientific literature, particularly UAV applications in disaster and emergency situations.","children":[],"payload":{"lines":"5,7"}}],"payload":{"lines":"2,3"}},{"content":"Installation","children":[{"content":"<code>pip install top2vec</code>","children":[],"payload":{"lines":"8,9"}},{"content":"<code>pip install tensorflow==2.8.0</code>","children":[],"payload":{"lines":"9,10"}},{"content":"<code>pip install tensorflow-probability==0.16.0</code>","children":[],"payload":{"lines":"10,12"}}],"payload":{"lines":"7,8"}},{"content":"Usage","children":[{"content":"<strong>Load Documents:</strong>","children":[{"content":"Source: arXiv articles in Computer Science.","children":[],"payload":{"lines":"14,15"}},{"content":"Time Frame: 2010 to 2023.","children":[],"payload":{"lines":"15,16"}}],"payload":{"lines":"13,16"}},{"content":"<strong>Initialize Model:</strong>","children":[{"content":"<code>Top2Vec(documents=docs, embedding_model='universal-sentence-encoder')</code>","children":[],"payload":{"lines":"17,18"}}],"payload":{"lines":"16,18"}},{"content":"<strong>Model Training:</strong>","children":[{"content":"UMAP for dimensionality reduction.","children":[],"payload":{"lines":"19,20"}},{"content":"HDBSCAN for clustering.","children":[],"payload":{"lines":"20,21"}}],"payload":{"lines":"18,21"}},{"content":"<strong>Save Model:</strong>","children":[{"content":"<code>model.save('top2vec_scientific_texts_model')</code>","children":[],"payload":{"lines":"22,24"}}],"payload":{"lines":"21,24"}}],"payload":{"lines":"12,13"}},{"content":"Thematic Groups Analysis","children":[{"content":"<strong>Example Group:</strong> \"UAV in Disasters and Emergency\"","children":[{"content":"<strong>Trend Analysis:</strong> Visualize interest over time.","children":[],"payload":{"lines":"26,27"}},{"content":"<strong>Key Metrics:</strong> Number of publications, growth rate, relative growth.","children":[],"payload":{"lines":"27,29"}}],"payload":{"lines":"25,29"}}],"payload":{"lines":"24,25"}},{"content":"Use Cases","children":[{"content":"<strong>Topic Discovery:</strong> Uncover main topics within scientific texts.","children":[],"payload":{"lines":"30,31"}},{"content":"<strong>Semantic Search:</strong> Find similar documents based on semantic content.","children":[],"payload":{"lines":"31,32"}},{"content":"<strong>Trend Analysis:</strong> Study the evolution of topics over time.","children":[],"payload":{"lines":"32,34"}}],"payload":{"lines":"29,30"}},{"content":"Data","children":[{"content":"<strong>Dataset:</strong> <a href=\"https://huggingface.co/datasets/CCRss/arxiv_papers_cs\">arxiv_papers_cs</a>","children":[{"content":"Contains scientific abstracts from arXiv.","children":[],"payload":{"lines":"36,38"}}],"payload":{"lines":"35,38"}}],"payload":{"lines":"34,35"}},{"content":"Examples","children":[{"content":"<strong>Trend Analysis Graph:</strong> Visualization of interest in \"UAV in Disasters and Emergency\".","children":[],"payload":{"lines":"39,40"}},{"content":"<strong>Key Metrics Table:</strong> Summary of publications and growth metrics for the thematic group.","children":[],"payload":{"lines":"40,42"}}],"payload":{"lines":"38,39"}},{"content":"Contributions","children":[{"content":"<strong>Feedback:</strong> Suggestions and improvements are welcome.","children":[],"payload":{"lines":"43,44"}},{"content":"<strong>Issues:</strong> Please report any issues or bugs.","children":[],"payload":{"lines":"44,46"}}],"payload":{"lines":"42,43"}}],"payload":{"lines":"0,1"}},null)</script>
34
+ </body>
35
+ </html>
36
+
37
+
38
  # Top2Vec Scientific Texts Model
39
 
40
+ ![MindMap](disasters_and_emergency_plot.png)
41
+
42
  This repository hosts the `top2vec_scientific_texts` model, a specialized Top2Vec model trained on scientific texts for topic modeling and semantic search.
43
 
44
  ## Model Overview
 
115
 
116
  ### Key Metrics Table
117
 
118
+ Analysis for Thematic Group: Disasters & Emergency
119
+ | Year | Number of Publications | Change in Number of Publications | Relative Growth | Growth Acceleration |
120
+ |-------:|-------------------------:|-----------------------------------:|:------------------|----------------------:|
121
+ | 2010 | 19 | 0 | 0.0% | nan |
122
+ | 2011 | 15 | -4 | -21.05% | -4 |
123
+ | 2012 | 28 | 13 | 86.67% | 17 |
124
+ | 2013 | 38 | 10 | 35.71% | -3 |
125
+ | 2014 | 28 | -10 | -26.32% | -20 |
126
+ | 2015 | 47 | 19 | 67.86% | 29 |
127
+ | 2016 | 63 | 16 | 34.04% | -3 |
128
+ | 2017 | 94 | 31 | 49.21% | 15 |
129
+ | 2018 | 173 | 79 | 84.04% | 48 |
130
+ | 2019 | 266 | 93 | 53.76% | 14 |
131
+ | 2020 | 337 | 71 | 26.69% | -22 |
132
+ | 2021 | 380 | 43 | 12.76% | -28 |
133
+ | 2022 | 453 | 73 | 19.21% | 30 |
134
+ | 2023 | 509 | 56 | 12.36% | -17 |
135
+
136
  ## Contributions
137
 
138
  We welcome contributions to the top2vec_scientific_texts model. If you have suggestions, improvements, or encounter any issues, please feel free to open an issue or submit a pull request.