Karim shoair commited on
Commit
e0b5a75
·
1 Parent(s): aa29356

docs: update benchmarks with the current results

Browse files
docs/README.md CHANGED
@@ -244,14 +244,15 @@ Scrapling isn't just powerful—it's also blazing fast, and the updates since ve
244
 
245
  | # | Library | Time (ms) | vs Scrapling |
246
  |---|:-----------------:|:---------:|:------------:|
247
- | 1 | Scrapling | 1.92 | 1.0x |
248
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
249
- | 3 | Raw Lxml | 2.33 | 1.214x |
250
- | 4 | PyQuery | 20.61 | ~11x |
251
- | 5 | Selectolax | 80.65 | ~42x |
252
- | 6 | BS4 with Lxml | 1283.21 | ~698x |
253
- | 7 | MechanicalSoup | 1304.57 | ~679x |
254
- | 8 | BS4 with html5lib | 3331.96 | ~1735x |
 
255
 
256
  ### Element Similarity & Text Search Performance
257
 
@@ -259,8 +260,8 @@ Scrapling's adaptive element finding capabilities significantly outperform alter
259
 
260
  | Library | Time (ms) | vs Scrapling |
261
  |-------------|:---------:|:------------:|
262
- | Scrapling | 1.87 | 1.0x |
263
- | AutoScraper | 10.24 | 5.476x |
264
 
265
 
266
  > All benchmarks represent averages of 100+ runs. See [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) for methodology.
 
244
 
245
  | # | Library | Time (ms) | vs Scrapling |
246
  |---|:-----------------:|:---------:|:------------:|
247
+ | 1 | Scrapling | 1.99 | 1.0x |
248
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
249
+ | 3 | Raw Lxml | 2.5 | 1.256x |
250
+ | 4 | PyQuery | 22.93 | ~11.5x |
251
+ | 5 | Selectolax | 80.57 | ~40.5x |
252
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
253
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
254
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
255
+
256
 
257
  ### Element Similarity & Text Search Performance
258
 
 
260
 
261
  | Library | Time (ms) | vs Scrapling |
262
  |-------------|:---------:|:------------:|
263
+ | Scrapling | 2.46 | 1.0x |
264
+ | AutoScraper | 13.3 | 5.407x |
265
 
266
 
267
  > All benchmarks represent averages of 100+ runs. See [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) for methodology.
docs/README_AR.md CHANGED
@@ -233,24 +233,25 @@ Scrapling ليس قوياً فقط - إنه أيضاً سريع بشكل مذه
233
  ### اختبار سرعة استخراج النص (5000 عنصر متداخل)
234
 
235
  | # | المكتبة | الوقت (ms) | vs Scrapling |
236
- |---|:-----------------:|:---------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 with Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 | BS4 with html5lib | 3331.96 | ~1735x |
 
245
 
246
  ### أداء تشابه العناصر والبحث النصي
247
 
248
  قدرات العثور على العناصر التكيفية لـ Scrapling تتفوق بشكل كبير على البدائل:
249
 
250
- | المكتبة | الوقت (ms) | vs Scrapling |
251
- |-------------|:---------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > تمثل جميع المعايير متوسطات أكثر من 100 تشغيل. انظر [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) للمنهجية.
 
233
  ### اختبار سرعة استخراج النص (5000 عنصر متداخل)
234
 
235
  | # | المكتبة | الوقت (ms) | vs Scrapling |
236
+ |---|:-----------------:|:----------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### أداء تشابه العناصر والبحث النصي
248
 
249
  قدرات العثور على العناصر التكيفية لـ Scrapling تتفوق بشكل كبير على البدائل:
250
 
251
+ | المكتبة | الوقت (ms) | vs Scrapling |
252
+ |-------------|:----------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > تمثل جميع المعايير متوسطات أكثر من 100 تشغيل. انظر [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) للمنهجية.
docs/README_CN.md CHANGED
@@ -232,25 +232,26 @@ Scrapling不仅功能强大——它还速度极快,自0.3版本以来的更
232
 
233
  ### 文本提取速度测试(5000个嵌套元素)
234
 
235
- | # | | 时间(ms) | vs Scrapling |
236
- |---|:--------------:|:--------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 with Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 |BS4 with html5lib| 3331.96 | ~1735x |
 
245
 
246
  ### 元素相似性和文本搜索性能
247
 
248
  Scrapling的自适应元素查找功能明显优于替代方案:
249
 
250
- | | 时间(ms) | vs Scrapling |
251
- |-------------|:--------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > 所有基准测试代表100+次运行的平均值。请参阅[benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py)了解方法。
 
232
 
233
  ### 文本提取速度测试(5000个嵌套元素)
234
 
235
+ | # | | 时间(ms) | vs Scrapling |
236
+ |---|:-----------------:|:-------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### 元素相似性和文本搜索性能
248
 
249
  Scrapling的自适应元素查找功能明显优于替代方案:
250
 
251
+ | | 时间(ms) | vs Scrapling |
252
+ |-------------|:------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > 所有基准测试代表100+次运行的平均值。请参阅[benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py)了解方法。
docs/README_DE.md CHANGED
@@ -232,25 +232,26 @@ Scrapling ist nicht nur leistungsstark – es ist auch blitzschnell, und die Upd
232
 
233
  ### Textextraktions-Geschwindigkeitstest (5000 verschachtelte Elemente)
234
 
235
- | # | Bibliothek | Zeit (ms) | vs Scrapling |
236
- |---|:--------------------:|:---------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 mit Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 | BS4 mit html5lib | 3331.96 | ~1735x |
 
245
 
246
  ### Element-Ähnlichkeit & Textsuche-Leistung
247
 
248
  Scraplings adaptive Element-Finding-Fähigkeiten übertreffen Alternativen deutlich:
249
 
250
- | Bibliothek | Zeit (ms) | vs Scrapling |
251
  |-------------|:---------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > Alle Benchmarks stellen Durchschnittswerte von über 100 Durchläufen dar. Siehe [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) für die Methodik.
 
232
 
233
  ### Textextraktions-Geschwindigkeitstest (5000 verschachtelte Elemente)
234
 
235
+ | # | Bibliothek | Zeit (ms) | vs Scrapling |
236
+ |---|:-----------------:|:---------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### Element-Ähnlichkeit & Textsuche-Leistung
248
 
249
  Scraplings adaptive Element-Finding-Fähigkeiten übertreffen Alternativen deutlich:
250
 
251
+ | Bibliothek | Zeit (ms) | vs Scrapling |
252
  |-------------|:---------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > Alle Benchmarks stellen Durchschnittswerte von über 100 Durchläufen dar. Siehe [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) für die Methodik.
docs/README_ES.md CHANGED
@@ -232,25 +232,26 @@ Scrapling no solo es poderoso, también es increíblemente rápido, y las actual
232
 
233
  ### Prueba de Velocidad de Extracción de Texto (5000 elementos anidados)
234
 
235
- | # | Biblioteca | Tiempo (ms) | vs Scrapling |
236
- |---|:--------------------:|:-----------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 con Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 | BS4 con html5lib | 3331.96 | ~1735x |
 
245
 
246
  ### Rendimiento de Similitud de Elementos y Búsqueda de Texto
247
 
248
  Las capacidades de búsqueda adaptativa de elementos de Scrapling superan significativamente a las alternativas:
249
 
250
- | Biblioteca | Tiempo (ms) | vs Scrapling |
251
- |--------------|:-----------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > Todos los benchmarks representan promedios de más de 100 ejecuciones. Ver [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) para la metodología.
 
232
 
233
  ### Prueba de Velocidad de Extracción de Texto (5000 elementos anidados)
234
 
235
+ | # | Biblioteca | Tiempo (ms) | vs Scrapling |
236
+ |---|:-----------------:|:-----------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### Rendimiento de Similitud de Elementos y Búsqueda de Texto
248
 
249
  Las capacidades de búsqueda adaptativa de elementos de Scrapling superan significativamente a las alternativas:
250
 
251
+ | Biblioteca | Tiempo (ms) | vs Scrapling |
252
+ |-------------|:-----------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > Todos los benchmarks representan promedios de más de 100 ejecuciones. Ver [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) para la metodología.
docs/README_JP.md CHANGED
@@ -232,25 +232,26 @@ Scraplingは強力であるだけでなく、驚くほど高速で、バージ
232
 
233
  ### テキスト抽出速度テスト(5000個のネストされた要素)
234
 
235
- | # | ライブラリ | 時間(ms) | vs Scrapling |
236
- |---|:-------------------:|:--------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 with Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 | BS4 with html5lib | 3331.96 | ~1735x |
 
245
 
246
  ### 要素類似性とテキスト検索のパフォーマンス
247
 
248
  Scraplingの適応型要素検索機能は代替手段を大幅に上回ります:
249
 
250
- | ライブラリ | 時間(ms) | vs Scrapling |
251
- |-------------|:--------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > すべてのベンチマークは100回以上の実行の平均を表します。方法論については[benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py)を参照してください。
 
232
 
233
  ### テキスト抽出速度テスト(5000個のネストされた要素)
234
 
235
+ | # | ライブラリ | 時間(ms) | vs Scrapling |
236
+ |---|:-----------------:|:-------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### 要素類似性とテキスト検索のパフォーマンス
248
 
249
  Scraplingの適応型要素検索機能は代替手段を大幅に上回ります:
250
 
251
+ | ライブラリ | 時間(ms) | vs Scrapling |
252
+ |-------------|:------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > すべてのベンチマークは100回以上の実行の平均を表します。方法論については[benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py)を参照してください。
docs/README_RU.md CHANGED
@@ -232,25 +232,26 @@ Scrapling не только мощный - он также невероятно
232
 
233
  ### Тест скорости извлечения текста (5000 вложенных элементов)
234
 
235
- | # | Библиотека | Время (мс) | vs Scrapling |
236
- |---|:--------------------:|:----------:|:------------:|
237
- | 1 | Scrapling | 1.92 | 1.0x |
238
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
239
- | 3 | Raw Lxml | 2.33 | 1.214x |
240
- | 4 | PyQuery | 20.61 | ~11x |
241
- | 5 | Selectolax | 80.65 | ~42x |
242
- | 6 | BS4 с Lxml | 1283.21 | ~698x |
243
- | 7 | MechanicalSoup | 1304.57 | ~679x |
244
- | 8 | BS4 с html5lib | 3331.96 | ~1735x |
 
245
 
246
  ### Производительность подобия элементов и текстового поиска
247
 
248
  Возможности адаптивного поиска элементов Scrapling значительно превосходят альтернативы:
249
 
250
- | Библиотека | Время (мс) | vs Scrapling |
251
  |-------------|:----------:|:------------:|
252
- | Scrapling | 1.87 | 1.0x |
253
- | AutoScraper | 10.24 | 5.476x |
254
 
255
 
256
  > Все тесты производительности представляют собой средние значения более 100 запусков. См. [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) для методологии.
 
232
 
233
  ### Тест скорости извлечения текста (5000 вложенных элементов)
234
 
235
+ | # | Библиотека | Время (мс) | vs Scrapling |
236
+ |---|:-----------------:|:----------:|:------------:|
237
+ | 1 | Scrapling | 1.99 | 1.0x |
238
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
239
+ | 3 | Raw Lxml | 2.5 | 1.256x |
240
+ | 4 | PyQuery | 22.93 | ~11.5x |
241
+ | 5 | Selectolax | 80.57 | ~40.5x |
242
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
243
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
244
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
245
+
246
 
247
  ### Производительность подобия элементов и текстового поиска
248
 
249
  Возможности адаптивного поиска элементов Scrapling значительно превосходят альтернативы:
250
 
251
+ | Библиотека | Время (мс) | vs Scrapling |
252
  |-------------|:----------:|:------------:|
253
+ | Scrapling | 2.46 | 1.0x |
254
+ | AutoScraper | 13.3 | 5.407x |
255
 
256
 
257
  > Все тесты производительности представляют собой средние значения более 100 запусков. См. [benchmarks.py](https://github.com/D4Vinci/Scrapling/blob/main/benchmarks.py) для методологии.
docs/benchmarks.md CHANGED
@@ -8,20 +8,20 @@ Scrapling isn't just powerful—it's also blazing fast, and the updates since ve
8
 
9
  | # | Library | Time (ms) | vs Scrapling |
10
  |---|:-----------------:|:---------:|:------------:|
11
- | 1 | Scrapling | 1.92 | 1.0x |
12
- | 2 | Parsel/Scrapy | 1.99 | 1.036x |
13
- | 3 | Raw Lxml | 2.33 | 1.214x |
14
- | 4 | PyQuery | 20.61 | ~11x |
15
- | 5 | Selectolax | 80.65 | ~42x |
16
- | 6 | BS4 with Lxml | 1283.21 | ~698x |
17
- | 7 | MechanicalSoup | 1304.57 | ~679x |
18
- | 8 | BS4 with html5lib | 3331.96 | ~1735x |
19
 
20
  ### Element Similarity & Text Search Performance
21
 
22
  Scrapling's adaptive element finding capabilities significantly outperform alternatives:
23
 
24
- | Library | Time (ms) | vs Scrapling |
25
  |-------------|:---------:|:------------:|
26
- | Scrapling | 1.87 | 1.0x |
27
- | AutoScraper | 10.24 | 5.476x |
 
8
 
9
  | # | Library | Time (ms) | vs Scrapling |
10
  |---|:-----------------:|:---------:|:------------:|
11
+ | 1 | Scrapling | 1.99 | 1.0x |
12
+ | 2 | Parsel/Scrapy | 2.01 | 1.01x |
13
+ | 3 | Raw Lxml | 2.5 | 1.256x |
14
+ | 4 | PyQuery | 22.93 | ~11.5x |
15
+ | 5 | Selectolax | 80.57 | ~40.5x |
16
+ | 6 | BS4 with Lxml | 1541.37 | ~774.6x |
17
+ | 7 | MechanicalSoup | 1547.35 | ~777.6x |
18
+ | 8 | BS4 with html5lib | 3410.58 | ~1713.9x |
19
 
20
  ### Element Similarity & Text Search Performance
21
 
22
  Scrapling's adaptive element finding capabilities significantly outperform alternatives:
23
 
24
+ | Library | Time (ms) | vs Scrapling |
25
  |-------------|:---------:|:------------:|
26
+ | Scrapling | 2.46 | 1.0x |
27
+ | AutoScraper | 13.3 | 5.407x |