yukee1992 commited on
Commit
cfe45d2
·
verified ·
1 Parent(s): e68d970

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md CHANGED
@@ -10,3 +10,38 @@ pinned: false
10
  ---
11
 
12
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
13
+
14
+ # Web Scraper for n8n
15
+
16
+ A lightweight web scraper API designed to work with n8n via HTTP Request nodes, deployed on Hugging Face Spaces.
17
+
18
+ ## Features
19
+ - ✅ Direct HTML text extraction (fastest method)
20
+ - ✅ BeautifulSoup parsing when available
21
+ - ✅ Simple regex-based fallback parsing
22
+ - ✅ FastAPI backend with CORS enabled
23
+ - ✅ Gradio web interface for testing
24
+ - ✅ Optimized for Hugging Face Spaces
25
+
26
+ ## Usage with n8n
27
+
28
+ 1. Deploy this space on Hugging Face
29
+ 2. In n8n, add an **HTTP Request** node
30
+ 3. Configure:
31
+ - **Method**: POST
32
+ - **URL**: `https://your-username-space-name.hf.space/scrape`
33
+ - **Headers**: `Content-Type: application/json`
34
+ - **Body**:
35
+ ```json
36
+ {
37
+ "url": "{{ $json.url }}"
38
+ }
39
+ ```
40
+
41
+ ## API Endpoints
42
+
43
+ - `GET /` - API documentation
44
+ - `GET /health` - Health check
45
+ - `POST /scrape` - Main scraping endpoint
46
+
47
+ ## Example n8n Workflow