Harika22 commited on
Commit
b3b48bc
ยท
verified ยท
1 Parent(s): f7c5f07

Update pages/6_Feature_Engineering.py

Browse files
Files changed (1) hide show
  1. pages/6_Feature_Engineering.py +63 -32
pages/6_Feature_Engineering.py CHANGED
@@ -76,40 +76,71 @@ st.markdown("""
76
  </style>
77
  """, unsafe_allow_html=True)
78
 
 
79
 
80
- st.header("Feature Engineering๐Ÿ“Œ")
81
- st.markdown('''
82
- - When you already have n- no.of features which belongs to collected data and we want to add extra feature where this is automatically engineered made from existing features and the technique of creating the feature is called **feature engineering**
83
- - There is a sub part in feature engineering known as feature extraction
84
- ''')
 
 
 
 
 
85
 
86
- st.subheader(":violet[Feature Extraxtion]")
87
- st.markdown('''
88
- - We are having text data which is natural language where the text is given to machine to understand the natural language
89
- - Text is converted into vector form with feature extraction techniques using algorithms which helps to convert text iinto vector
90
- - While converting text into vector information should be preserved
91
- ''')
92
 
93
- st.header("Vectorization๐Ÿงญ")
94
- st.markdown('''
95
- - Vectorization is a technique of converting text into vectors
96
- ''')
 
 
 
 
 
 
 
 
 
97
 
98
- st.subheader("Vectorization techniques")
99
- st.markdown("""
100
- There a different techniques to convert text into vector format.They are :
101
- <ul class="icon-bullet">
102
- <li>One-Hot Vectorization </li>
103
- <li>Bag of Words(BOW) </li>
104
- <li>Term Frequency - Inverse Document Frequency(TF-IDF) </li>
105
- </ul>
106
- """, unsafe_allow_html=True)
 
107
 
108
- st.markdown("""
109
- There are advance vectorization techniques.They are :
110
- <ul class="icon-bullet">
111
- <li>Word Embedding </li>
112
- <li>Word2Vec </li>
113
- <li>Fasttext</li>
114
- </ul>
115
- """, unsafe_allow_html=True)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
76
  </style>
77
  """, unsafe_allow_html=True)
78
 
79
+ st.markdown("<h1 class='header-title'>๐Ÿ› ๏ธ Feature Engineering ๐Ÿ“Œ</h1>", unsafe_allow_html=True)
80
 
81
+ st.markdown(
82
+ """
83
+ <div class='info-box'>
84
+ <p>๐Ÿ”น When we take **existing features** from collected data and create **new useful features**,where this is automatically engineered made from existing features and the technique of creating the features is known as <span class='highlight'>Feature Engineering</span>.</p>
85
+ <p>๐Ÿ“Š These engineered features **enhance machine learning models**.</p>
86
+ <p>๐Ÿ“ A subpart of feature engineering is **Feature Extraction**.</p>
87
+ </div>
88
+ """,
89
+ unsafe_allow_html=True
90
+ )
91
 
92
+ st.markdown("<h2 class='sub-header'>๐Ÿ“ฅ Feature Extraction</h2>", unsafe_allow_html=True)
 
 
 
 
 
93
 
94
+ st.markdown(
95
+ """
96
+ <div class='info-box'>
97
+ <p>๐Ÿ“ **Feature Extraction** is the process where text data which is natural language is given to machine to understand the natural language.</p>
98
+ <ul>
99
+ <li>๐Ÿ“œ Text is **converted into vectors** using specific algorithms.</li>
100
+ <li>๐Ÿ” **Preserving meaningful information** is key.</li>
101
+ <li>โš™๏ธ Helps in better **text analysis & machine learning**.</li>
102
+ </ul>
103
+ </div>
104
+ """,
105
+ unsafe_allow_html=True
106
+ )
107
 
108
+ st.markdown("<h1 class='header-title'>๐Ÿงญ Vectorization</h1>", unsafe_allow_html=True)
109
+ st.markdown(
110
+ """
111
+ <div class='info-box'>
112
+ <p>๐Ÿ“Œ **Vectorization** is the process of converting text into **numerical vectors**.</p>
113
+ <p>๐Ÿ’ก This allows ML models to process text data effectively.</p>
114
+ </div>
115
+ """,
116
+ unsafe_allow_html=True
117
+ )
118
 
119
+ st.markdown("<h2 class='sub-header'>๐Ÿ› ๏ธ Vectorization Techniques</h2>", unsafe_allow_html=True)
120
+ st.markdown(
121
+ """
122
+ <div class='info-box'>
123
+ <p>๐Ÿ“Œ Basic Vectorization Techniques:</p>
124
+ <ul>
125
+ <li>๐Ÿ”น One-Hot Encoding</li>
126
+ <li>๐Ÿ”น Bag of Words (BoW)</li>
127
+ <li>๐Ÿ”น Term Frequency - Inverse Document Frequency (TF-IDF)</li>
128
+ </ul>
129
+ </div>
130
+ """,
131
+ unsafe_allow_html=True
132
+ )
133
+
134
+ st.markdown(
135
+ """
136
+ <div class='info-box'>
137
+ <p>๐Ÿš€ Advanced Vectorization Techniques:</p>
138
+ <ul>
139
+ <li>๐ŸŒ Word Embeddings</li>
140
+ <li>๐Ÿ”  Word2Vec</li>
141
+ <li>โšก FastText</li>
142
+ </ul>
143
+ </div>
144
+ """,
145
+ unsafe_allow_html=True
146
+ )