Update pages/5_Pre-procesing_of_text.py
Browse files
pages/5_Pre-procesing_of_text.py
CHANGED
|
@@ -104,12 +104,26 @@ st.markdown("β
**Stemming & Lemmatization** β Perform only if grammar is **n
|
|
| 104 |
|
| 105 |
st.markdown("</div>", unsafe_allow_html=True)
|
| 106 |
|
| 107 |
-
|
| 108 |
-
|
| 109 |
|
| 110 |
st.markdown(
|
| 111 |
"""
|
| 112 |
-
<div class='
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 113 |
""",
|
| 114 |
-
unsafe_allow_html=True
|
| 115 |
-
)
|
|
|
|
| 104 |
|
| 105 |
st.markdown("</div>", unsafe_allow_html=True)
|
| 106 |
|
| 107 |
+
st.markdown("<h1 class='header-title'>π Stemming & Lemmatization π¬</h1>", unsafe_allow_html=True)
|
|
|
|
| 108 |
|
| 109 |
st.markdown(
|
| 110 |
"""
|
| 111 |
+
<div class='info-box'>
|
| 112 |
+
<p>π In English, words are often made up of three components:</p>
|
| 113 |
+
<ul>
|
| 114 |
+
<li>πΉ <span class='highlight'>Prefix</span> + <span class='highlight'>Word</span> + <span class='highlight'>Suffix</span></li>
|
| 115 |
+
</ul>
|
| 116 |
+
<p> Words **without a suffix** are called <span class='highlight'>Root Words</span>.</p>
|
| 117 |
+
<p> If a **suffix is added** to a root word, the resulting word is an <span class='highlight'>Inflected Word</span>:</p>
|
| 118 |
+
<ul>
|
| 119 |
+
<li>π οΈ <span class='highlight'>Root Word</span> + <span class='highlight'>Suffix</span> = Inflected Word</li>
|
| 120 |
+
</ul>
|
| 121 |
+
<p>The process of **removing the suffix from inflected words** to get the root word is known as:</p>
|
| 122 |
+
<ul>
|
| 123 |
+
<li><span class='highlight'>Stemming</span></li>
|
| 124 |
+
<li><span class='highlight'>Lemmatization</span></li>
|
| 125 |
+
</ul>
|
| 126 |
+
</div>
|
| 127 |
""",
|
| 128 |
+
unsafe_allow_html=True
|
| 129 |
+
)
|