Spaces:
Sleeping
Sleeping
Update app.py
Browse files
app.py
CHANGED
|
@@ -231,8 +231,6 @@ with st.expander("ℹ️ App Information & Chunking Details", expanded=False):
|
|
| 231 |
Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
|
| 232 |
2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
|
| 233 |
Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
|
| 234 |
-
|
| 235 |
-
_Note: Some websites may block content scraping. This is an early version, so you might encounter bugs._
|
| 236 |
"""
|
| 237 |
, icon="ℹ️")
|
| 238 |
|
|
|
|
| 231 |
Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
|
| 232 |
2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
|
| 233 |
Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
|
|
|
|
|
|
|
| 234 |
"""
|
| 235 |
, icon="ℹ️")
|
| 236 |
|