Karim shoair commited on
Commit ·
706c444
1
Parent(s): 3456843
docs: updating `extract commans` page
Browse files
docs/cli/extract-commands.md
CHANGED
|
@@ -4,12 +4,12 @@
|
|
| 4 |
|
| 5 |
The `scrapling extract` command lets you download and extract content from websites directly from your terminal without writing any code. Ideal for beginners, researchers, and anyone requiring rapid web data extraction.
|
| 6 |
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
| 11 |
-
|
| 12 |
-
|
| 13 |
|
| 14 |
|
| 15 |
## What is the Extract Command group?
|
|
@@ -320,7 +320,7 @@ We will go through each command in detail below.
|
|
| 320 |
--solve-cloudflare / --no-solve-cloudflare Solve Cloudflare challenges (default: False)
|
| 321 |
--allow-webgl / --block-webgl Allow WebGL (default: True)
|
| 322 |
--network-idle / --no-network-idle Wait for network idle (default: False)
|
| 323 |
-
---real-chrome/--no-real-
|
| 324 |
--hide-canvas/--show-canvas Add noise to canvas operations (default: False)
|
| 325 |
--timeout INTEGER Timeout in milliseconds (default: 30000)
|
| 326 |
--wait INTEGER Additional wait time in milliseconds after page load (default: 0)
|
|
|
|
| 4 |
|
| 5 |
The `scrapling extract` command lets you download and extract content from websites directly from your terminal without writing any code. Ideal for beginners, researchers, and anyone requiring rapid web data extraction.
|
| 6 |
|
| 7 |
+
!!! success "Prerequisites"
|
| 8 |
+
|
| 9 |
+
1. You've completed or read the [Fetchers basics](../fetching/choosing.md) page to understand what the [Response object](../fetching/choosing.md#response-object) is and which fetcher to use.
|
| 10 |
+
2. You've completed or read the [Querying elements](../parsing/selection.md) page to understand how to find/extract elements from the [Selector](../parsing/main_classes.md#selector)/[Response](../fetching/choosing.md#response-object) object.
|
| 11 |
+
3. You've completed or read the [Main classes](../parsing/main_classes.md) page to know what properties/methods the [Response](../fetching/choosing.md#response-object) class is inheriting from the [Selector](../parsing/main_classes.md#selector) class.
|
| 12 |
+
4. You've completed or read at least one page from the fetchers section to use here for requests: [HTTP requests](../fetching/static.md), [Dynamic websites](../fetching/dynamic.md), or [Dynamic websites with hard protections](../fetching/stealthy.md).
|
| 13 |
|
| 14 |
|
| 15 |
## What is the Extract Command group?
|
|
|
|
| 320 |
--solve-cloudflare / --no-solve-cloudflare Solve Cloudflare challenges (default: False)
|
| 321 |
--allow-webgl / --block-webgl Allow WebGL (default: True)
|
| 322 |
--network-idle / --no-network-idle Wait for network idle (default: False)
|
| 323 |
+
---real-chrome/--no-real-chrome If you have a Chrome browser installed on your device, enable this, and the Fetcher will launch an instance of your browser and use it. (default: False)
|
| 324 |
--hide-canvas/--show-canvas Add noise to canvas operations (default: False)
|
| 325 |
--timeout INTEGER Timeout in milliseconds (default: 30000)
|
| 326 |
--wait INTEGER Additional wait time in milliseconds after page load (default: 0)
|