Spaces:
Sleeping
Sleeping
Command Line Interface
Since v0.3, Scrapling includes a powerful command-line interface that provides three main capabilities:
- Interactive Shell: An interactive Web Scraping shell based on IPython that provides many shortcuts and useful tools
- Extract Commands: Scrape websites from the terminal without any programming
- Utility Commands: Installation and management tools
# Launch interactive shell
scrapling shell
# Convert the content of a page to markdown and save it to a file
scrapling extract get "https://example.com" content.md
# Get help for any command
scrapling --help
scrapling extract --help
Requirements
This section requires you to install the extra shell dependency group, like the following:
pip install "scrapling[shell]"
and the installation of the fetchers' dependencies with the following command
scrapling install
This downloads all browsers, along with their system dependencies and fingerprint manipulation dependencies.