Em4e commited on
Commit
37f325d
·
verified ·
1 Parent(s): 7926d85

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +0 -2
app.py CHANGED
@@ -231,8 +231,6 @@ with st.expander("ℹ️ App Information & Chunking Details", expanded=False):
231
  Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
232
  2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
233
  Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
234
-
235
- _Note: Some websites may block content scraping. This is an early version, so you might encounter bugs._
236
  """
237
  , icon="ℹ️")
238
 
 
231
  Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
232
  2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
233
  Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
 
 
234
  """
235
  , icon="ℹ️")
236