Spaces:
Sleeping
Sleeping
Update pages/Data Collection.py
Browse files- pages/Data Collection.py +52 -44
pages/Data Collection.py
CHANGED
|
@@ -11,7 +11,7 @@ st.set_page_config(
|
|
| 11 |
st.markdown("""
|
| 12 |
<style>
|
| 13 |
body, .stApp {
|
| 14 |
-
color: #4F4F4F;
|
| 15 |
background-color: #FFFFFF;
|
| 16 |
}
|
| 17 |
h1, h2, h3, h4, h5, h6 {
|
|
@@ -74,20 +74,33 @@ if data_type == "Structured Data":
|
|
| 74 |
</div>
|
| 75 |
""", unsafe_allow_html=True)
|
| 76 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 77 |
st.write("""
|
| 78 |
**Structured data** refers to information that is organized and formatted in a predefined manner, making it easy to store, retrieve, and analyze.
|
| 79 |
It is typically stored in tabular formats like rows and columns, where each field contains a specific type of information.
|
| 80 |
""")
|
| 81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 82 |
st.write("""
|
| 83 |
-
**Characteristics:**
|
| 84 |
- Follows a fixed schema.
|
| 85 |
- Can be easily searched using query languages like SQL.
|
| 86 |
- Suitable for quantitative analysis.
|
| 87 |
""")
|
| 88 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 89 |
st.write("""
|
| 90 |
-
**Example:**
|
| 91 |
A database of students with fields like ID, name, age, and gender:
|
| 92 |
""")
|
| 93 |
|
|
@@ -113,8 +126,15 @@ if data_type == "Structured Data":
|
|
| 113 |
- **Excel**: Widely used for tabular data, calculations, and data visualization.
|
| 114 |
""")
|
| 115 |
|
| 116 |
-
if st.button("
|
| 117 |
-
st.write("
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 118 |
|
| 119 |
# Unstructured Data Section
|
| 120 |
elif data_type == "Unstructured Data":
|
|
@@ -124,24 +144,23 @@ elif data_type == "Unstructured Data":
|
|
| 124 |
</div>
|
| 125 |
""", unsafe_allow_html=True)
|
| 126 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 127 |
st.write("""
|
| 128 |
**Unstructured data** refers to information that does not follow a predefined format or structure.
|
| 129 |
-
It is typically raw data that lacks a clear, organized schema, making it harder to store and analyze using traditional tools.
|
| 130 |
""")
|
| 131 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 132 |
st.write("""
|
| 133 |
-
**Characteristics:**
|
| 134 |
- Does not follow a specific schema or structure.
|
| 135 |
-
-
|
| 136 |
-
- Requires advanced tools like machine learning or natural language processing (NLP) for analysis.
|
| 137 |
-
""")
|
| 138 |
-
|
| 139 |
-
st.write("""
|
| 140 |
-
**Examples:**
|
| 141 |
-
- **Images**: Photos, screenshots, or scanned documents.
|
| 142 |
-
- **Audio**: Podcasts, voice recordings, or music files.
|
| 143 |
-
- **Videos**: Recorded lectures, surveillance footage, or YouTube videos.
|
| 144 |
-
- **Text**: Emails, social media posts, and blog articles.
|
| 145 |
""")
|
| 146 |
|
| 147 |
st.markdown("""
|
|
@@ -150,26 +169,26 @@ elif data_type == "Unstructured Data":
|
|
| 150 |
</div>
|
| 151 |
""", unsafe_allow_html=True)
|
| 152 |
|
| 153 |
-
if st.button("
|
| 154 |
st.write("Select a format to explore:")
|
| 155 |
|
| 156 |
# Sub-buttons for specific formats
|
| 157 |
col1, col2, col3, col4 = st.columns(4)
|
| 158 |
|
| 159 |
with col1:
|
| 160 |
-
if st.button("Images"):
|
| 161 |
st.write("Redirecting to Images page...")
|
| 162 |
|
| 163 |
with col2:
|
| 164 |
-
if st.button("Audio"):
|
| 165 |
st.write("Redirecting to Audio page...")
|
| 166 |
|
| 167 |
with col3:
|
| 168 |
-
if st.button("Video"):
|
| 169 |
st.write("Redirecting to Video page...")
|
| 170 |
|
| 171 |
with col4:
|
| 172 |
-
if st.button("Text"):
|
| 173 |
st.write("Redirecting to Text page...")
|
| 174 |
|
| 175 |
# Semi-Structured Data Section
|
|
@@ -180,24 +199,13 @@ elif data_type == "Semi-Structured Data":
|
|
| 180 |
</div>
|
| 181 |
""", unsafe_allow_html=True)
|
| 182 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 183 |
st.write("""
|
| 184 |
-
**Semi-Structured data** refers to information that
|
| 185 |
-
This type of data is more flexible than structured data but still organized enough to allow for easier analysis than unstructured data.
|
| 186 |
-
""")
|
| 187 |
-
|
| 188 |
-
st.write("""
|
| 189 |
-
**Characteristics:**
|
| 190 |
-
- Contains markers or tags (e.g., XML, JSON keys) to provide structure.
|
| 191 |
-
- More flexible than structured data, allowing for varying schemas.
|
| 192 |
-
- Easier to process than unstructured data.
|
| 193 |
-
""")
|
| 194 |
-
|
| 195 |
-
st.write("""
|
| 196 |
-
**Examples:**
|
| 197 |
-
- **CSV**: Comma-separated values in plain-text files.
|
| 198 |
-
- **JSON**: A lightweight data-interchange format used in web applications.
|
| 199 |
-
- **XML**: Extensible Markup Language for structured document encoding.
|
| 200 |
-
- **HTML**: Markup language for web pages.
|
| 201 |
""")
|
| 202 |
|
| 203 |
st.markdown("""
|
|
@@ -206,24 +214,24 @@ elif data_type == "Semi-Structured Data":
|
|
| 206 |
</div>
|
| 207 |
""", unsafe_allow_html=True)
|
| 208 |
|
| 209 |
-
if st.button("
|
| 210 |
st.write("Select a format to explore:")
|
| 211 |
|
| 212 |
# Sub-buttons for specific formats
|
| 213 |
col1, col2, col3, col4 = st.columns(4)
|
| 214 |
|
| 215 |
with col1:
|
| 216 |
-
if st.button("CSV"):
|
| 217 |
st.write("Redirecting to CSV page...")
|
| 218 |
|
| 219 |
with col2:
|
| 220 |
-
if st.button("JSON"):
|
| 221 |
st.write("Redirecting to JSON page...")
|
| 222 |
|
| 223 |
with col3:
|
| 224 |
-
if st.button("XML"):
|
| 225 |
st.write("Redirecting to XML page...")
|
| 226 |
|
| 227 |
with col4:
|
| 228 |
-
if st.button("HTML"):
|
| 229 |
st.write("Redirecting to HTML page...")
|
|
|
|
| 11 |
st.markdown("""
|
| 12 |
<style>
|
| 13 |
body, .stApp {
|
| 14 |
+
color: #4F4F4F; /* Replace with your desired font color */
|
| 15 |
background-color: #FFFFFF;
|
| 16 |
}
|
| 17 |
h1, h2, h3, h4, h5, h6 {
|
|
|
|
| 74 |
</div>
|
| 75 |
""", unsafe_allow_html=True)
|
| 76 |
|
| 77 |
+
st.markdown("""
|
| 78 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 79 |
+
<h4 style="color: #5b2c6f;">Definition:</h4>
|
| 80 |
+
</div>
|
| 81 |
+
""", unsafe_allow_html=True)
|
| 82 |
st.write("""
|
| 83 |
**Structured data** refers to information that is organized and formatted in a predefined manner, making it easy to store, retrieve, and analyze.
|
| 84 |
It is typically stored in tabular formats like rows and columns, where each field contains a specific type of information.
|
| 85 |
""")
|
| 86 |
|
| 87 |
+
st.markdown("""
|
| 88 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 89 |
+
<h4 style="color: #5b2c6f;">Characteristics:</h4>
|
| 90 |
+
</div>
|
| 91 |
+
""", unsafe_allow_html=True)
|
| 92 |
st.write("""
|
|
|
|
| 93 |
- Follows a fixed schema.
|
| 94 |
- Can be easily searched using query languages like SQL.
|
| 95 |
- Suitable for quantitative analysis.
|
| 96 |
""")
|
| 97 |
|
| 98 |
+
st.markdown("""
|
| 99 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 100 |
+
<h4 style="color: #5b2c6f;">Examples:</h4>
|
| 101 |
+
</div>
|
| 102 |
+
""", unsafe_allow_html=True)
|
| 103 |
st.write("""
|
|
|
|
| 104 |
A database of students with fields like ID, name, age, and gender:
|
| 105 |
""")
|
| 106 |
|
|
|
|
| 126 |
- **Excel**: Widely used for tabular data, calculations, and data visualization.
|
| 127 |
""")
|
| 128 |
|
| 129 |
+
if st.button("Structured Data Formats"):
|
| 130 |
+
st.write("Select a format to explore:")
|
| 131 |
+
|
| 132 |
+
# Sub-buttons for specific formats
|
| 133 |
+
col1, = st.columns(1)
|
| 134 |
+
|
| 135 |
+
with col1:
|
| 136 |
+
if st.button("Explore Excel"):
|
| 137 |
+
st.write("Redirecting to Excel page...")
|
| 138 |
|
| 139 |
# Unstructured Data Section
|
| 140 |
elif data_type == "Unstructured Data":
|
|
|
|
| 144 |
</div>
|
| 145 |
""", unsafe_allow_html=True)
|
| 146 |
|
| 147 |
+
st.markdown("""
|
| 148 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 149 |
+
<h4 style="color: #5b2c6f;">Definition:</h4>
|
| 150 |
+
</div>
|
| 151 |
+
""", unsafe_allow_html=True)
|
| 152 |
st.write("""
|
| 153 |
**Unstructured data** refers to information that does not follow a predefined format or structure.
|
|
|
|
| 154 |
""")
|
| 155 |
|
| 156 |
+
st.markdown("""
|
| 157 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 158 |
+
<h4 style="color: #5b2c6f;">Characteristics:</h4>
|
| 159 |
+
</div>
|
| 160 |
+
""", unsafe_allow_html=True)
|
| 161 |
st.write("""
|
|
|
|
| 162 |
- Does not follow a specific schema or structure.
|
| 163 |
+
- Requires advanced tools for analysis.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 164 |
""")
|
| 165 |
|
| 166 |
st.markdown("""
|
|
|
|
| 169 |
</div>
|
| 170 |
""", unsafe_allow_html=True)
|
| 171 |
|
| 172 |
+
if st.button("Unstructured Data Formats"):
|
| 173 |
st.write("Select a format to explore:")
|
| 174 |
|
| 175 |
# Sub-buttons for specific formats
|
| 176 |
col1, col2, col3, col4 = st.columns(4)
|
| 177 |
|
| 178 |
with col1:
|
| 179 |
+
if st.button("Explore Images"):
|
| 180 |
st.write("Redirecting to Images page...")
|
| 181 |
|
| 182 |
with col2:
|
| 183 |
+
if st.button("Explore Audio"):
|
| 184 |
st.write("Redirecting to Audio page...")
|
| 185 |
|
| 186 |
with col3:
|
| 187 |
+
if st.button("Explore Video"):
|
| 188 |
st.write("Redirecting to Video page...")
|
| 189 |
|
| 190 |
with col4:
|
| 191 |
+
if st.button("Explore Text"):
|
| 192 |
st.write("Redirecting to Text page...")
|
| 193 |
|
| 194 |
# Semi-Structured Data Section
|
|
|
|
| 199 |
</div>
|
| 200 |
""", unsafe_allow_html=True)
|
| 201 |
|
| 202 |
+
st.markdown("""
|
| 203 |
+
<div style="text-align: left; margin-top: 20px;">
|
| 204 |
+
<h4 style="color: #5b2c6f;">Definition:</h4>
|
| 205 |
+
</div>
|
| 206 |
+
""", unsafe_allow_html=True)
|
| 207 |
st.write("""
|
| 208 |
+
**Semi-Structured data** refers to information that contains markers or tags for structure but is not stored in a strict tabular format.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 209 |
""")
|
| 210 |
|
| 211 |
st.markdown("""
|
|
|
|
| 214 |
</div>
|
| 215 |
""", unsafe_allow_html=True)
|
| 216 |
|
| 217 |
+
if st.button("Semi-Structured Data Formats"):
|
| 218 |
st.write("Select a format to explore:")
|
| 219 |
|
| 220 |
# Sub-buttons for specific formats
|
| 221 |
col1, col2, col3, col4 = st.columns(4)
|
| 222 |
|
| 223 |
with col1:
|
| 224 |
+
if st.button("Explore CSV"):
|
| 225 |
st.write("Redirecting to CSV page...")
|
| 226 |
|
| 227 |
with col2:
|
| 228 |
+
if st.button("Explore JSON"):
|
| 229 |
st.write("Redirecting to JSON page...")
|
| 230 |
|
| 231 |
with col3:
|
| 232 |
+
if st.button("Explore XML"):
|
| 233 |
st.write("Redirecting to XML page...")
|
| 234 |
|
| 235 |
with col4:
|
| 236 |
+
if st.button("Explore HTML"):
|
| 237 |
st.write("Redirecting to HTML page...")
|