Update app.py
Browse files
app.py
CHANGED
|
@@ -41,25 +41,6 @@ The NLP lifecycle consists of several stages, each contributing to transforming
|
|
| 41 |
|
| 42 |
These stages are crucial for building effective NLP applications that provide value to users.
|
| 43 |
""")
|
| 44 |
-
|
| 45 |
-
# NLP Techniques
|
| 46 |
-
st.header('NLP Techniques')
|
| 47 |
-
st.write("""
|
| 48 |
-
Some key techniques used in NLP include:
|
| 49 |
-
|
| 50 |
-
- **Tokenization**: The process of breaking down text into smaller units, such as words or sentences.
|
| 51 |
-
- **Stop Word Removal**: The process of removing common words (e.g., "the", "a", "and") that do not contribute significant meaning to the text.
|
| 52 |
-
- **Stemming**: Reducing words to their root form (e.g., "running" β "run").
|
| 53 |
-
- **Lemmatization**: Similar to stemming but more accurate, reducing words to their dictionary form (e.g., "better" β "good").
|
| 54 |
-
- **Named Entity Recognition (NER)**: Identifying entities such as people, organizations, and locations within text.
|
| 55 |
-
- **Part-of-Speech Tagging**: Identifying the grammatical structure of words in a sentence, such as nouns, verbs, adjectives, etc.
|
| 56 |
-
- **Word Embeddings**: A technique that maps words into continuous vector space, capturing semantic relationships between words (e.g., Word2Vec, GloVe).
|
| 57 |
-
- **Text Classification**: Categorizing text into predefined labels or categories (e.g., spam detection, sentiment analysis).
|
| 58 |
-
- **Sentiment Analysis**: Determining the sentiment expressed in a text, such as whether it is positive, negative, or neutral.
|
| 59 |
-
|
| 60 |
-
These techniques are the building blocks for solving various NLP tasks and are essential for developing applications that can understand human language.
|
| 61 |
-
""")
|
| 62 |
-
|
| 63 |
# Define the available NLP lifecycle stages
|
| 64 |
lifecycle_stages = ['Data Collection', 'Text Preprocessing', 'Text Representation',
|
| 65 |
'Model Training', 'Evaluation', 'Deployment']
|
|
@@ -164,6 +145,25 @@ elif selected_page == 'Deployment':
|
|
| 164 |
- Retraining may be required periodically to account for changes in language usage or new trends in the data.
|
| 165 |
""")
|
| 166 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 167 |
# Define the available NLP tasks
|
| 168 |
tasks = ['Text Classification', 'Sentiment Analysis', 'Named Entity Recognition (NER)',
|
| 169 |
'Language Translation', 'Text Summarization', 'Part-of-Speech Tagging',
|
|
|
|
| 41 |
|
| 42 |
These stages are crucial for building effective NLP applications that provide value to users.
|
| 43 |
""")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 44 |
# Define the available NLP lifecycle stages
|
| 45 |
lifecycle_stages = ['Data Collection', 'Text Preprocessing', 'Text Representation',
|
| 46 |
'Model Training', 'Evaluation', 'Deployment']
|
|
|
|
| 145 |
- Retraining may be required periodically to account for changes in language usage or new trends in the data.
|
| 146 |
""")
|
| 147 |
|
| 148 |
+
# NLP Techniques
|
| 149 |
+
st.header('NLP Techniques')
|
| 150 |
+
st.write("""
|
| 151 |
+
Some key techniques used in NLP include:
|
| 152 |
+
|
| 153 |
+
- **Tokenization**: The process of breaking down text into smaller units, such as words or sentences.
|
| 154 |
+
- **Stop Word Removal**: The process of removing common words (e.g., "the", "a", "and") that do not contribute significant meaning to the text.
|
| 155 |
+
- **Stemming**: Reducing words to their root form (e.g., "running" β "run").
|
| 156 |
+
- **Lemmatization**: Similar to stemming but more accurate, reducing words to their dictionary form (e.g., "better" β "good").
|
| 157 |
+
- **Named Entity Recognition (NER)**: Identifying entities such as people, organizations, and locations within text.
|
| 158 |
+
- **Part-of-Speech Tagging**: Identifying the grammatical structure of words in a sentence, such as nouns, verbs, adjectives, etc.
|
| 159 |
+
- **Word Embeddings**: A technique that maps words into continuous vector space, capturing semantic relationships between words (e.g., Word2Vec, GloVe).
|
| 160 |
+
- **Text Classification**: Categorizing text into predefined labels or categories (e.g., spam detection, sentiment analysis).
|
| 161 |
+
- **Sentiment Analysis**: Determining the sentiment expressed in a text, such as whether it is positive, negative, or neutral.
|
| 162 |
+
|
| 163 |
+
These techniques are the building blocks for solving various NLP tasks and are essential for developing applications that can understand human language.
|
| 164 |
+
""")
|
| 165 |
+
|
| 166 |
+
|
| 167 |
# Define the available NLP tasks
|
| 168 |
tasks = ['Text Classification', 'Sentiment Analysis', 'Named Entity Recognition (NER)',
|
| 169 |
'Language Translation', 'Text Summarization', 'Part-of-Speech Tagging',
|