Spaces:
Sleeping
Sleeping
| title: Screenshot Scraper | |
| emoji: π | |
| colorFrom: gray | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 6.4.0 | |
| app_file: app.py | |
| pinned: false | |
| Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference | |
| --- | |
| title: Web Scraper for n8n | |
| emoji: π | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 4.19.0 | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| --- | |
| # Web Scraper for n8n | |
| A simple web scraper API that extracts text from webpages, designed to work with n8n via HTTP requests. | |
| ## Features | |
| - β Extract text content from any webpage | |
| - β BeautifulSoup for smart HTML parsing | |
| - β Simple regex fallback | |
| - β JSON API for n8n integration | |
| - β Gradio web interface | |
| ## Usage with n8n | |
| 1. **HTTP Request Node Configuration:** | |
| - **Method:** POST | |
| - **URL:** `https://your-username-space-name.hf.space/scrape` | |
| - **Body:** | |
| ```json | |
| { | |
| "url": "https://example.com" | |
| } | |
| ``` | |
| 2. **Example Response:** | |
| ```json | |
| { | |
| "success": true, | |
| "url": "https://example.com", | |
| "execution_time": 0.45, | |
| "method": "beautifulsoup", | |
| "extracted_text": "...", | |
| "text_length": 1234 | |
| } |