LakshmiHarika commited on
Commit
8d8a98a
·
verified ·
1 Parent(s): 3128429

Update pages/Data Collection.py

Browse files
Files changed (1) hide show
  1. pages/Data Collection.py +78 -0
pages/Data Collection.py CHANGED
@@ -140,3 +140,81 @@ if data_type == "Structured Data":
140
  st.query_params["page"] = "Excel"
141
  st.rerun()
142
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
140
  st.query_params["page"] = "Excel"
141
  st.rerun()
142
 
143
+ # Unstructured Data Section
144
+ elif data_type == "Unstructured Data":
145
+ st.markdown("""
146
+ <div style="text-align: left; margin-top: 20px;">
147
+ <h3 style="color: #e25822;">What is Unstructured Data?</h3>
148
+ </div>
149
+ """, unsafe_allow_html=True)
150
+
151
+ st.markdown("""
152
+ <div style="text-align: left; margin-top: 20px;">
153
+ <h4 style="color: #5b2c6f;">Definition:</h4>
154
+ </div>
155
+ """, unsafe_allow_html=True)
156
+ st.write("""
157
+ Unstructured data refers to information that does not follow a predefined format or structure.
158
+ It is typically raw data that lacks a clear, organized schema, making it harder to store and analyze using traditional tools.
159
+
160
+ Examples include multimedia files (images, videos, audio), emails, and social media posts.
161
+ """)
162
+
163
+ st.markdown("""
164
+ <div style="text-align: left; margin-top: 20px;">
165
+ <h4 style="color: #5b2c6f;">Characteristics:</h4>
166
+ </div>
167
+ """, unsafe_allow_html=True)
168
+ st.write("""
169
+ - Does not follow a specific schema or structure.
170
+ - Cannot be stored in traditional tabular formats like rows and columns.
171
+ - Requires advanced tools like machine learning or natural language processing (NLP) for analysis.
172
+ """)
173
+
174
+ st.markdown("""
175
+ <div style="text-align: left; margin-top: 20px;">
176
+ <h4 style="color: #5b2c6f;">Example:</h4>
177
+ </div>
178
+ """, unsafe_allow_html=True)
179
+ st.write("""
180
+ Examples of unstructured data include:
181
+ - **Images**: Photos, screenshots, or scanned documents.
182
+ - **Audio**: Podcasts, voice recordings, or music files.
183
+ - **Videos**: Recorded lectures, surveillance footage, or YouTube videos.
184
+ - **Text**: Emails, social media posts, and blog articles.
185
+ """)
186
+
187
+ # Transition to Formats in Unstructured Data
188
+ st.markdown("""
189
+ <div style="text-align: left; margin-top: 20px;">
190
+ <h4 style="color: #5b2c6f;">Data Formats in Unstructured Data:</h4>
191
+ </div>
192
+ """, unsafe_allow_html=True)
193
+
194
+ st.write("""
195
+ Unstructured data can exist in various formats, often requiring specialized tools for processing. Common formats include:
196
+
197
+ - **Images**: Formats like JPEG, PNG, BMP, and TIFF.
198
+ - **Audio**: Formats like MP3, WAV, and FLAC.
199
+ - **Videos**: Formats like MP4, AVI, and MKV.
200
+ - **Text**: Formats like TXT, LOG, and DOCX.
201
+ """)
202
+
203
+ # Add a button to navigate to specific formats
204
+ if st.button("Explore Unstructured Data"):
205
+ # Display buttons for each format
206
+ st.write("Select a format to explore:")
207
+
208
+ if st.button("Images"):
209
+ st.write("Images are a type of unstructured data often used in computer vision. Common formats: JPEG, PNG.")
210
+
211
+ if st.button("Audio"):
212
+ st.write("Audio files represent sound data. Common formats: MP3, WAV.")
213
+
214
+ if st.button("Video"):
215
+ st.write("Video files combine visual and audio data. Common formats: MP4, AVI.")
216
+
217
+ if st.button("Text"):
218
+ st.write("Text data includes raw documents like emails or logs. Common formats: TXT, DOCX.")
219
+
220
+