Update pages/6_Feature_Engineering.py
Browse files- pages/6_Feature_Engineering.py +15 -17
pages/6_Feature_Engineering.py
CHANGED
|
@@ -81,42 +81,41 @@ st.markdown("<h1 class='header-title'>π οΈ Feature Engineering π</h1>", un
|
|
| 81 |
st.markdown(
|
| 82 |
"""
|
| 83 |
<div class='info-box'>
|
| 84 |
-
<p>πΉ When we take
|
| 85 |
-
<p
|
| 86 |
-
<p
|
| 87 |
</div>
|
| 88 |
""",
|
| 89 |
unsafe_allow_html=True
|
| 90 |
)
|
| 91 |
|
| 92 |
-
st.
|
| 93 |
-
|
| 94 |
st.markdown(
|
| 95 |
"""
|
| 96 |
<div class='info-box'>
|
| 97 |
-
<p>π
|
| 98 |
<ul>
|
| 99 |
-
<li
|
| 100 |
-
<li
|
| 101 |
-
<li
|
| 102 |
</ul>
|
| 103 |
</div>
|
| 104 |
""",
|
| 105 |
unsafe_allow_html=True
|
| 106 |
)
|
| 107 |
|
| 108 |
-
st.
|
| 109 |
st.markdown(
|
| 110 |
"""
|
| 111 |
<div class='info-box'>
|
| 112 |
-
<p
|
| 113 |
-
<p
|
| 114 |
</div>
|
| 115 |
""",
|
| 116 |
unsafe_allow_html=True
|
| 117 |
)
|
| 118 |
|
| 119 |
-
st.
|
| 120 |
st.markdown(
|
| 121 |
"""
|
| 122 |
<div class='info-box'>
|
|
@@ -130,15 +129,14 @@ st.markdown(
|
|
| 130 |
""",
|
| 131 |
unsafe_allow_html=True
|
| 132 |
)
|
| 133 |
-
|
| 134 |
st.markdown(
|
| 135 |
"""
|
| 136 |
<div class='info-box'>
|
| 137 |
<p>π Advanced Vectorization Techniques:</p>
|
| 138 |
<ul>
|
| 139 |
-
<li
|
| 140 |
-
<li
|
| 141 |
-
<li
|
| 142 |
</ul>
|
| 143 |
</div>
|
| 144 |
""",
|
|
|
|
| 81 |
st.markdown(
|
| 82 |
"""
|
| 83 |
<div class='info-box'>
|
| 84 |
+
<p>πΉ When we take existing features from collected data and create new useful features, where this is automatically engineered made from existing features and the technique of creating the features is known as <span class='highlight'>Feature Engineering</span>.</p>
|
| 85 |
+
<p> These engineered features enhance machine learning models.</p>
|
| 86 |
+
<p> A subpart of feature engineering is Feature Extraction.</p>
|
| 87 |
</div>
|
| 88 |
""",
|
| 89 |
unsafe_allow_html=True
|
| 90 |
)
|
| 91 |
|
| 92 |
+
st.subheader(":violet[Feature Extraxtion]")
|
|
|
|
| 93 |
st.markdown(
|
| 94 |
"""
|
| 95 |
<div class='info-box'>
|
| 96 |
+
<p>π Feature Extraction is the process where text data which is natural language is given to machine to understand the natural language.</p>
|
| 97 |
<ul>
|
| 98 |
+
<li>Text is converted into vectors using specific algorithms.</li>
|
| 99 |
+
<li>Preserving meaningful information is key.</li>
|
| 100 |
+
<li>Helps in better text analysis & machine learning</li>
|
| 101 |
</ul>
|
| 102 |
</div>
|
| 103 |
""",
|
| 104 |
unsafe_allow_html=True
|
| 105 |
)
|
| 106 |
|
| 107 |
+
st.header("Vectorizationπ§")
|
| 108 |
st.markdown(
|
| 109 |
"""
|
| 110 |
<div class='info-box'>
|
| 111 |
+
<p>Vectorization is the process of converting text into vector.</p>
|
| 112 |
+
<p>This allows ML models to process text data effectively.</p>
|
| 113 |
</div>
|
| 114 |
""",
|
| 115 |
unsafe_allow_html=True
|
| 116 |
)
|
| 117 |
|
| 118 |
+
st.subheader("Vectorization techniques")
|
| 119 |
st.markdown(
|
| 120 |
"""
|
| 121 |
<div class='info-box'>
|
|
|
|
| 129 |
""",
|
| 130 |
unsafe_allow_html=True
|
| 131 |
)
|
|
|
|
| 132 |
st.markdown(
|
| 133 |
"""
|
| 134 |
<div class='info-box'>
|
| 135 |
<p>π Advanced Vectorization Techniques:</p>
|
| 136 |
<ul>
|
| 137 |
+
<li>πΉ Word Embeddings</li>
|
| 138 |
+
<li>πΉ Word2Vec</li>
|
| 139 |
+
<li>πΉFastText</li>
|
| 140 |
</ul>
|
| 141 |
</div>
|
| 142 |
""",
|