Scrapling / scrapling

Commit History

style: remove unused import
10ff0d5

Karim shoair commited on

fix: update code and docstrings to remove the old google referer logic
a28879b

Karim shoair commited on

build: pump up deps and browsers versions
7bd4753

Karim shoair commited on

Merge branch 'dev' into fix/google-referrer-spoof
6bb2723

Karim shoair commited on

fix: return bare Google URL in referer instead of search query
8b89653

bortles commited on

- Add pre-compiled XPath text selector
133ff8f

mph commited on

fix: Selector.get_all_text() doesn't get all text #167
c9a1787

mph commited on

fix(type hints): use correct import for Python < 3.12 (Fixes #163 )
1c774eb

Karim shoair commited on

build: pump up the version
a29b545

Karim shoair Christian Rost commited on

fix(browsers): make flag concatenation type-safe
814b9a6

Karim shoair Christian Rost commited on

fix(stealth): improve stealth mode by removing unnecessary scripts
cf06b6e

Karim shoair commited on

fix(mcp): make mcp use less tokens by striping useless tags
130d1d8

Karim shoair commited on

fix(solver): Solve CF faster and handle websites that show captcha twice
446b9d5

Karim shoair commited on

build: pump version up and the deps
138d6ee

Karim shoair commited on

fix: complete MCP schema validation for remaining bare array types
5f93804

Karim shoair commited on

Merge branch 'dev' into fix/mcp-schema-auth-cookies
c3afea7

Karim shoair commited on

fix: make MCP get schemas validator-safe
0d7a361

Robin Ede commited on

fix: improve regex precision for Cloudflare challenge detection
979ef6f

RinZ27 commited on

fix(checkpoint): correct error message
3ddd178

Karim shoair commited on

fix(spiders): corrections and removal of dead code
c923d18

Karim shoair commited on

style: docstrings corrections for accuracy
5618e2f

Karim shoair commited on

fix: make browser fetchers type hints always present
7d2ea72

Karim shoair commited on

fix: use more accurate naming for proxy rotation strategy
61a910a

Karim shoair commited on

fix(parser)!: Optimize parser for repeated operations
a1afba2

Karim shoair commited on

style(spiders): Lower the default number of concurrent requests
288b1a4

Karim shoair commited on

fix: add typed overloads to Selectors.get() for proper default type inference
b942a24

Karim shoair commited on

build: pump up the version status
c2167a0

Karim shoair commited on

style: expose `ProxyRotator` class to the public API
04eb5f9

Karim shoair commited on

style: Use shorter and more accurate naming for constants
c2dbf4c

Karim shoair commited on

build: pump up version and deps
7c5afe3

Karim shoair commited on

style: Fix all mypy errors and add type hints to untyped function bodies
31c2447

Karim shoair commited on

feat(browsers): Add option to block requests to specific domains
47dd985

Karim shoair commited on

style(parser): Improve the type hint for `find_by_text` and `find_by_regex`
f67ebd1

Karim shoair commited on

fix(response): Force the body of the response to always be bytes
1b8cc6b

Karim shoair commited on

feat(parser)!: Make all selection return selector objects by default
a5f9b38

Karim shoair commited on

feat(proxy control): Force a proxy at request level at any given point
32daccc

Karim shoair commited on

feat(spiders/fetchers): Adding proxy rotation logic and change retry logic
ed96cdc

Karim shoair commited on

fix(browsers): Improve stealth and speed by adjusting flags
88b6ab5

Karim shoair commited on

fix(browsers): disable autoplay for all browsers
1d04135

Karim shoair commited on

feat(parser): replacing `tldextract` with `tld` library
e5733cb

Karim shoair commited on

fix(requests): return the cached fingerprint on second call
e30685e

Karim shoair commited on

feat(spiders/requests): Use a reliable method for fingerprint generation
e227110

Karim shoair commited on

style: update post requests type hints to be more accurate
c3c5ac6

Karim shoair commited on

build: Add w3lib to deps
da0d22b

Karim shoair commited on

feat(spiders): Make graceful shutdown always enabled
5ec9f75

Karim shoair commited on

fix(spiders): handle errors with manual pause during stream mode
47633d8

Karim shoair commited on

fix(spiders): fix a bug with session stats counter
98a7a0d

Karim shoair commited on

fix(spiders): Fix for when the pausing system has large queue
59117ec

Karim shoair commited on

feat(spiders): Add pause/resume system for crawls
1721cbb

Karim shoair commited on

fix(browsers): solving an issues with leaving playwright loop open when cdp connection fails
8ff23b3

Karim shoair commited on