Update pages/6_Feature_Engineering.py
Browse files
pages/6_Feature_Engineering.py
CHANGED
|
@@ -365,7 +365,7 @@ elif file_type == "Bag of Words(BOW)":
|
|
| 365 |
''')
|
| 366 |
|
| 367 |
st.markdown("""
|
| 368 |
-
### 🛠️ Steps in Bag of Words(
|
| 369 |
- Create a vocabulary (set of unique words)
|
| 370 |
- Each document is converted into vector form(d-dimension)
|
| 371 |
- In bag of words the value is count , but in binary bag of words it tells whether the word is preseent or not
|
|
@@ -374,4 +374,8 @@ elif file_type == "Bag of Words(BOW)":
|
|
| 374 |
- Calculation of distance will be way more faster than bag of words
|
| 375 |
- distance is total no.of unique words between two documents
|
| 376 |
""")
|
|
|
|
|
|
|
|
|
|
|
|
|
| 377 |
|
|
|
|
| 365 |
''')
|
| 366 |
|
| 367 |
st.markdown("""
|
| 368 |
+
### 🛠️ Steps in Binary Bag of Words(BBOW):
|
| 369 |
- Create a vocabulary (set of unique words)
|
| 370 |
- Each document is converted into vector form(d-dimension)
|
| 371 |
- In bag of words the value is count , but in binary bag of words it tells whether the word is preseent or not
|
|
|
|
| 374 |
- Calculation of distance will be way more faster than bag of words
|
| 375 |
- distance is total no.of unique words between two documents
|
| 376 |
""")
|
| 377 |
+
|
| 378 |
+
|
| 379 |
+
elif file_type == "Term Frequency - Inverse Document Frequency(TF-IDF)":
|
| 380 |
+
st.title(":red[Term Frequency - Inverse Document Frequency(TF-IDF)]")
|
| 381 |
|