Harika22 commited on
Commit
2454fb2
·
verified ·
1 Parent(s): b8713da

Update pages/5_Pre-procesing_of_text.py

Browse files
Files changed (1) hide show
  1. pages/5_Pre-procesing_of_text.py +15 -7
pages/5_Pre-procesing_of_text.py CHANGED
@@ -61,11 +61,11 @@ st.markdown(
61
  <div class='section'>
62
  We will convert raw data into pre-processed data in 3 ways:
63
 
64
- - Cleaning ---> which is based on the problem statement
65
 
66
- - Simple pre-processing
67
 
68
- - Advance pre-processing
69
  </div>
70
  ''',
71
  unsafe_allow_html=True,
@@ -102,14 +102,22 @@ st.markdown(
102
 
103
  - Raw data - preprocessed data ---> required by the problem statement
104
  <ul>
105
- <li><b>Converting into particular case</b> So that highly we can reduce the dimensionalty.If the problem statement says that grammar should be preserved then no need of conversion</li>
106
- <li><b>Removing URL's / tags/mails/mentions</b> Converting or preserving information should be based on the problem statement</li>
107
- <li><b>Handling Emoji's</b> Emoji's data should be preserved</li>
108
  <li><b>Contractions and acronyms</b>Both the contractions and acronyms should be converted into general text</li>
109
- <li><b>Stop Words</b> Stop words make the grammar very clear</li>
110
  <li><b>Stemming and Lemmatization</b>Both are purely based on problm statement and if problem statement wants grammatical concept don't perform stemming</li>
111
  </ul>
112
  </div>
113
  ''',
114
  unsafe_allow_html=True,
115
  )
 
 
 
 
 
 
 
 
 
61
  <div class='section'>
62
  We will convert raw data into pre-processed data in 3 ways:
63
 
64
+ Cleaning - which is based on the problem statement
65
 
66
+ Simple pre-processing
67
 
68
+ Advance pre-processing
69
  </div>
70
  ''',
71
  unsafe_allow_html=True,
 
102
 
103
  - Raw data - preprocessed data ---> required by the problem statement
104
  <ul>
105
+ <li><b>Converting into particular case</b>So that highly we can reduce the dimensionalty.If the problem statement says that grammar should be preserved then no need of conversion</li>
106
+ <li><b>Removing URL's / tags/mails/mentions</b>Converting or preserving information should be based on the problem statement</li>
107
+ <li><b>Handling Emoji's</b>Emoji's data should be preserved</li>
108
  <li><b>Contractions and acronyms</b>Both the contractions and acronyms should be converted into general text</li>
109
+ <li><b>Stop Words</b>Stop words make the grammar very clear</li>
110
  <li><b>Stemming and Lemmatization</b>Both are purely based on problm statement and if problem statement wants grammatical concept don't perform stemming</li>
111
  </ul>
112
  </div>
113
  ''',
114
  unsafe_allow_html=True,
115
  )
116
+
117
+ st.markdown(
118
+ """
119
+ <div class='caption'>***Step into the world of NLP and discover the endless possibilities of language-driven innovation!***</div>
120
+ """,
121
+ unsafe_allow_html=True,
122
+ )
123
+