Spaces:

Harika22
/

Natural_Language_Processing

Sleeping

Harika22 commited on Feb 1, 2025

Commit

2454fb2

verified ·

1 Parent(s): b8713da

Update pages/5_Pre-procesing_of_text.py

Files changed (1) hide show

pages/5_Pre-procesing_of_text.py CHANGED Viewed

@@ -61,11 +61,11 @@ st.markdown(
     <div class='section'>
         We will convert raw data into pre-processed data in 3 ways:
-            - Cleaning ---> which is based on the problem statement
-            - Simple pre-processing
-            - Advance pre-processing
     </div>
     ''',
     unsafe_allow_html=True,
@@ -102,14 +102,22 @@ st.markdown(
             - Raw data - preprocessed data ---> required by the problem statement
         <ul>
-            <li><b>Converting into particular case</b> So that highly we can reduce the dimensionalty.If the problem statement says that grammar should be preserved then no need of conversion</li>
-            <li><b>Removing URL's / tags/mails/mentions</b> Converting or preserving information should be based on the problem statement</li>
-            <li><b>Handling Emoji's</b> Emoji's data should be preserved</li>
             <li><b>Contractions and acronyms</b>Both the contractions and acronyms should be converted into general text</li>
-            <li><b>Stop Words</b> Stop words make the grammar very clear</li>
             <li><b>Stemming and Lemmatization</b>Both are purely based on problm statement and if problem statement wants grammatical concept don't perform stemming</li>
         </ul>
     </div>
     ''',
     unsafe_allow_html=True,
 )

     <div class='section'>
         We will convert raw data into pre-processed data in 3 ways:
+        Cleaning - which is based on the problem statement
+        Simple pre-processing
+        Advance pre-processing
     </div>
     ''',
     unsafe_allow_html=True,
             - Raw data - preprocessed data ---> required by the problem statement
         <ul>
+            <li><b>Converting into particular case</b>So that highly we can reduce the dimensionalty.If the problem statement says that grammar should be preserved then no need of conversion</li>
+            <li><b>Removing URL's / tags/mails/mentions</b>Converting or preserving information should be based on the problem statement</li>
+            <li><b>Handling Emoji's</b>Emoji's data should be preserved</li>
             <li><b>Contractions and acronyms</b>Both the contractions and acronyms should be converted into general text</li>
+            <li><b>Stop Words</b>Stop words make the grammar very clear</li>
             <li><b>Stemming and Lemmatization</b>Both are purely based on problm statement and if problem statement wants grammatical concept don't perform stemming</li>
         </ul>
     </div>
     ''',
     unsafe_allow_html=True,
 )
+st.markdown(
+    """
+    <div class='caption'>***Step into the world of NLP and discover the endless possibilities of language-driven innovation!***</div>
+    """,
+    unsafe_allow_html=True,
+)