Spaces:

Em4e
/

chunk-based-text-editor

Sleeping

Em4e commited on Jun 9, 2025

Commit

37f325d

verified ·

1 Parent(s): 7926d85

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -231,8 +231,6 @@ with st.expander("ℹ️ App Information & Chunking Details", expanded=False):
            Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
         2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
            Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
-        _Note: Some websites may block content scraping. This is an early version, so you might encounter bugs._
         """
     , icon="ℹ️")

            Converts the webpage’s HTML into Markdown, translating tags (`<h1>`, `<p>`, `<ul>`) into their Markdown equivalents (`#`, paragraph breaks, `*`) to preserve layout and hierarchy.
         2. **Layout-Aware Parsing (`MarkdownNodeParser`):**
            Uses LlamaIndex’s `MarkdownNodeParser` to read the structured Markdown and split it at logical boundaries (headers like `#`, `##`, etc.), yielding context-aware chunks that respect original sections.
         """
     , icon="ℹ️")