Spaces:

Harika22
/

Natural_Language_Processing

Sleeping

App Files Files Community

Harika22 commited on Jan 27, 2025

Commit

ab7ea37

verified ·

1 Parent(s): aac424c

Update pages/2_Life cycle of NLP.py

Browse files

Files changed (1) hide show

pages/2_Life cycle of NLP.py +33 -48

pages/2_Life cycle of NLP.py CHANGED Viewed

@@ -1,5 +1,7 @@
 import streamlit as st
 # Custom CSS for styling, animations, and a light background
 st.markdown(
     """
@@ -51,71 +53,54 @@ st.markdown(
         .image-container img:hover {
             transform: scale(1.05); /* Subtle zoom effect */
         }
     </style>
     """,
     unsafe_allow_html=True,
 )
-# Title with animation
-st.markdown("<div class='title'>NLP Life Cycle</div>", unsafe_allow_html=True)
-# Caption with animation
 st.markdown(
-    "<div class='caption'>***Explore the Journey from Problem Statement to Deployment in NLP!***</div>",
     unsafe_allow_html=True,
 )
-# Radio buttons for navigation through the steps
-step = st.radio(
-    "Choose a step in the NLP Life Cycle:",
-    ["Problem Statement", "Data Collection", "Simple EDA", "Data Pre-processing",
-     "EDA", "Feature Engineering", "Training", "Testing", "Deployment/Monitoring"],
-)
-# Step-by-step description with animations
 if step == "Problem Statement":
-    st.markdown(
-        "<div class='section'><b>1. Problem Statement</b><br>In this initial stage, we define the problem to be solved using NLP. The problem statement sets the direction for the entire project, ensuring that all efforts align with the objectives and scope.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Data Collection":
-    st.markdown(
-        "<div class='section'><b>2. Data Collection</b><br>Here, we gather the necessary data from various sources such as text, audio, or images. High-quality, relevant data is crucial to the success of the model and the problem-solving process.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Simple EDA":
-    st.markdown(
-        "<div class='section'><b>3. Simple EDA</b><br>Exploratory Data Analysis (EDA) is performed to understand the dataset better. Simple statistical summaries and visualizations help to identify data patterns, anomalies, and initial insights.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Data Pre-processing":
-    st.markdown(
-        "<div class='section'><b>4. Data Pre-processing</b><br>At this stage, the data is cleaned and transformed for model compatibility. This involves tasks like tokenization, lowercasing, removing stop words, and handling missing values.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "EDA":
-    st.markdown(
-        "<div class='section'><b>5. EDA</b><br>In-depth Exploratory Data Analysis is performed to gain further insights. Visualizations like word clouds, bar plots, and histograms are used to understand relationships and distributions in the dataset.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Feature Engineering":
-    st.markdown(
-        "<div class='section'><b>6. Feature Engineering</b><br>Feature engineering involves creating new features or modifying existing ones to improve model performance. Techniques like word embeddings, TF-IDF, and vectorization are commonly used.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Training":
-    st.markdown(
-        "<div class='section'><b>7. Training</b><br>In this step, the data is split into training and validation sets, and machine learning models are trained using the processed features. Model selection and hyperparameter tuning are key parts of this phase.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Testing":
-    st.markdown(
-        "<div class='section'><b>8. Testing</b><br>Once the model is trained, it is tested on unseen data to evaluate its performance. Metrics like accuracy, precision, recall, and F1 score are used to assess the model's quality.</div>",
-        unsafe_allow_html=True,
-    )
 elif step == "Deployment/Monitoring":
-    st.markdown(
-        "<div class='section'><b>9. Deployment/Monitoring</b><br>After testing, the model is deployed into a production environment where it can make predictions. Continuous monitoring ensures the model performs well and adapts to new data over time.</div>",
-        unsafe_allow_html=True,
-    )

 import streamlit as st
+import streamlit as st
 # Custom CSS for styling, animations, and a light background
 st.markdown(
     """
         .image-container img:hover {
             transform: scale(1.05); /* Subtle zoom effect */
         }
+        .sidebar {
+            width: 200px;
+        }
     </style>
     """,
     unsafe_allow_html=True,
 )
+# Sidebar navigation with radio buttons
+st.sidebar.title("NLP Life Cycle Navigation")
+step = st.sidebar.radio("Choose a step in NLP Life Cycle",
+                       ("Problem Statement", "Data Collection", "Simple EDA", "Data Pre-processing", "EDA",
+                        "Feature Engineering", "Training", "Testing", "Deployment/Monitoring"))
+# Title for the NLP Life Cycle
+st.markdown("<div class='title'>Life Cycle of NLP</div>", unsafe_allow_html=True)
+# Caption
 st.markdown(
+    "<div class='caption'>Navigating the journey of NLP from start to deployment!...</div>",
     unsafe_allow_html=True,
 )
+# Display content based on the selected step from the sidebar
 if step == "Problem Statement":
+    st.markdown("<div class='section'><b>Problem Statement</b><br>Every NLP project begins by identifying the problem that needs solving. It could range from sentiment analysis to machine translation, based on the requirements.</div>", unsafe_allow_html=True)
 elif step == "Data Collection":
+    st.markdown("<div class='section'><b>Data Collection</b><br>The next step is to gather relevant text data from various sources such as servers, web-scrapping(text).</div>", unsafe_allow_html=True)
 elif step == "Simple EDA":
+    st.markdown("<div class='section'><b>Simple EDA</b><br>Before diving deep into modeling, it's crucial to understand the data. Simple EDA gives the quality of the collected text data.</div>", unsafe_allow_html=True)
 elif step == "Data Pre-processing":
+    st.markdown("<div class='section'><b>Data Pre-processing</b><br>Pre-processing includes cleaning the data and pre-processing using different techniques based on the problem statement.</div>", unsafe_allow_html=True)
 elif step == "EDA":
+    st.markdown("<div class='section'><b>EDA (Exploratory Data Analysis)</b><br>In this deeper phase of EDA, visualizations like word clouds, bar plots, and heatmaps are created to gain insights into the data. Identifying correlations, trends, and outliers is crucial here.</div>", unsafe_allow_html=True)
 elif step == "Feature Engineering":
+    st.markdown("<div class='section'><b>Feature Engineering</b><br>Feature engineering involves creating new features or transforming existing ones to better represent the data for machine learning models.Convert text into numerical format(**Vectorization**)</div>", unsafe_allow_html=True)
 elif step == "Training":
+    st.markdown("<div class='section'><b>Training</b><br>The model is trained using the pre-processed data.</div>", unsafe_allow_html=True)
 elif step == "Testing":
+    st.markdown("<div class='section'><b>Testing</b><br>After training, the model is evaluated on a separate test dataset.</div>", unsafe_allow_html=True)
 elif step == "Deployment/Monitoring":
+    st.markdown("<div class='section'><b>Deployment and Monitoring</b><br>Once the model is trained and tested, it is deployed into a real-world environment. Continuous monitoring is needed to ensure the model performs well over time, especially as new data comes in.</div>", unsafe_allow_html=True)