Scrapling / scrapling /spiders

Commit History

fix(checkpoint): correct error message
3ddd178

Karim shoair commited on

fix(spiders): corrections and removal of dead code
c923d18

Karim shoair commited on

style(spiders): Lower the default number of concurrent requests
288b1a4

Karim shoair commited on

style: Fix all mypy errors and add type hints to untyped function bodies
31c2447

Karim shoair commited on

feat(proxy control): Force a proxy at request level at any given point
32daccc

Karim shoair commited on

fix(requests): return the cached fingerprint on second call
e30685e

Karim shoair commited on

feat(spiders/requests): Use a reliable method for fingerprint generation
e227110

Karim shoair commited on

feat(spiders): Make graceful shutdown always enabled
5ec9f75

Karim shoair commited on

fix(spiders): handle errors with manual pause during stream mode
47633d8

Karim shoair commited on

fix(spiders): fix a bug with session stats counter
98a7a0d

Karim shoair commited on

fix(spiders): Fix for when the pausing system has large queue
59117ec

Karim shoair commited on

feat(spiders): Add pause/resume system for crawls
1721cbb

Karim shoair commited on

style(spiders): make parse function abstract
c1fb967

Karim shoair commited on

feat(spiders): Change items hook to make it for processing items + add a stat for this
01127e9

Karim shoair commited on

fix(spiders engine): Improve logging
168160d

Karim shoair commited on

feat(spiders): Add option to logging format and date format
77ddca9

Karim shoair commited on

feat: add hook to run after collecting an item
48cc7a2

Karim shoair commited on

feat(spiders): Add a stream mode to collect items in realtime
7145885

Karim shoair commited on

feat(spiders): The option to start sessions lazily
aa7527f

Karim shoair commited on

style: remove unwanted import from spiders main init file
d1c2b16

Karim shoair commited on

feat(spiders): add simple export system for the results
c5ebc9d

Karim shoair commited on

fix: make `retry_blocked_request` take the response as well
e251ff4

Karim shoair commited on

feat(spiders): Add logic to detect blocked requests and retry them
337bce0

Karim shoair commited on

style(spiders): make the usage of uvloop optional
be76f99

Karim shoair commited on

feat: spiders use uvloop/winloop when available
3068bf6

Karim shoair commited on

fix(browsers): Close pages that give error
8209e61

Karim shoair commited on

feat(spiders system): a prototype of the new spiders system
f8f7469

Karim shoair commited on