NealCaren commited on
Commit
b73ea8d
·
1 Parent(s): 87f3524

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +4 -4
app.py CHANGED
@@ -21,12 +21,12 @@ device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
21
  import pandas as pd
22
  st.title('Sociology Paragraph Search')
23
 
24
- st.write('This page is a work-in-progress that allows you to search through articles recently published in a few sociology journals and retrieve the most relevant paragraphs. ')
25
 
26
  st.markdown('''Notes:
27
- * To get the best results, search like you are using Google. My best luck comes from phrases, such as "social movements and public opinion", "inequality in latin america", "race color skin tone measurement", "audit study experiment gender", "crenshaw intersectionality" or "logistic regression or linear probability model". You can even use questions, such as "what is a topic model?" or "What is the dual process model?"
28
- * The dataset currently includes only article published since 2016 in Social Forces, Social Problems, Sociology of Race and Ethnicity, Gender and Society, Socius, JHSB, and the American Sociological Review (approximately 100K paragraphs from 2K articles).
29
- * The most relevant paragarph to your search is returned first, along with up to four other related paragraphs from that article.
30
  * The most relevant sentence within each paragraph, as determined by math, is bolded.
31
  * Behind the scenes, the semantic search uses [text embeddings](https://www.sbert.net) with a [retrieve & re-rank](https://colab.research.google.com/github/UKPLab/sentence-transformers/blob/master/examples/applications/retrieve_rerank/retrieve_rerank_simple_wikipedia.ipynb) process to find the best matches.
32
  * Let [me](mailto:neal.caren@unc.edu) know what you think.
 
21
  import pandas as pd
22
  st.title('Sociology Paragraph Search')
23
 
24
+ st.write('This project is a work-in-progress that searches through articles recently published in a few sociology journals and retrieves the most relevant paragraphs.')
25
 
26
  st.markdown('''Notes:
27
+ * To get the best results, search like you are using Google. My best luck comes from phrases like "social movements and public opinion", "inequality in latin america", "race color skin tone measurement", "audit study experiment gender", "crenshaw intersectionality", or "logistic regression or linear probability model". You can also use questions like "what is a topic model?" or "What is the dual process model?"
28
+ * The dataset currently includes sociology articles from Social Forces, Social Problems, Sociology of Race and Ethnicity, Gender and Society, Socius, JHSB, and the American Sociological Review published in the last five years, totaling approximately 100,000 paragraphs from 2,000 articles.
29
+ * The most relevant paragraph to your search is returned first, along with up to four other related paragraphs from that article.
30
  * The most relevant sentence within each paragraph, as determined by math, is bolded.
31
  * Behind the scenes, the semantic search uses [text embeddings](https://www.sbert.net) with a [retrieve & re-rank](https://colab.research.google.com/github/UKPLab/sentence-transformers/blob/master/examples/applications/retrieve_rerank/retrieve_rerank_simple_wikipedia.ipynb) process to find the best matches.
32
  * Let [me](mailto:neal.caren@unc.edu) know what you think.