![]() |
VOOZH | about |
dotnet add package ExcavatorSharp.WebScraper.x64 --version 1.2.8
NuGet\Install-Package ExcavatorSharp.WebScraper.x64 -Version 1.2.8
<PackageReference Include="ExcavatorSharp.WebScraper.x64" Version="1.2.8" />
<PackageVersion Include="ExcavatorSharp.WebScraper.x64" Version="1.2.8" />Directory.Packages.props
<PackageReference Include="ExcavatorSharp.WebScraper.x64" />Project file
paket add ExcavatorSharp.WebScraper.x64 --version 1.2.8
#r "nuget: ExcavatorSharp.WebScraper.x64, 1.2.8"
#:package ExcavatorSharp.WebScraper.x64@1.2.8
#addin nuget:?package=ExcavatorSharp.WebScraper.x64&version=1.2.8Install as a Cake Addin
#tool nuget:?package=ExcavatorSharp.WebScraper.x64&version=1.2.8Install as a Cake Tool
ExcavatorSharp is a multi-threaded server for scraping web data. It converts HTML code into a structured array of data. The library allows data scraping from multiple sites in parallel mode, within a single running application. Create scraping tasks and perform data extraction on a schedule.
The library is designed for professional extraction and parsing of large volumes of data. Under the hood there are .css-selectors and xpath support, data export into .csv/.xlsx/.sql/.json, online data export, support for proxy servers, dynamic content crawling, interaction with the site via javascript and much more. The library uses .NET Sockets and Chromium Embedded Framework.
The library can be used separately as crawler or parser. We support the formats sitemap.xml and robots.txt. We support the gzip / deflate compression.
Attention! Only x64 versions are supported for .NET 4.5.2 and 4.6 platforms. AnyCPU build does not support! You will NOT be able to run the library when building AnyCPU. This is caused by the features of CEF.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET Framework | net452 net452 is compatible. net46 net46 is compatible. net461 net461 was computed. net462 net462 was computed. net463 net463 was computed. net47 net47 was computed. net471 net471 was computed. net472 net472 was computed. net48 net48 was computed. net481 net481 was computed. |
This package is not used by any NuGet packages.
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated |
|---|---|---|
| 1.2.8 | 1,113 | 8/10/2020 |
| 1.2.7 | 678 | 8/10/2020 |
| 1.2.3 | 748 | 5/20/2020 |
| 1.2.2 | 705 | 5/10/2020 |
| 1.2.1 | 724 | 5/5/2020 |
| 1.2.0 | 748 | 4/30/2020 |
| 1.1.0 | 708 | 4/23/2020 |
| 1.0.53 | 688 | 4/12/2020 |
| 1.0.52 | 714 | 4/11/2020 |
| 1.0.51 | 710 | 4/11/2020 |
| 1.0.6 | 699 | 4/23/2020 |
| 1.0.5 | 713 | 4/11/2020 |
| 1.0.4 | 753 | 4/3/2020 |
| 1.0.3 | 730 | 2/12/2020 |
| 1.0.2 | 748 | 1/30/2020 |
| 1.0.1 | 729 | 1/30/2020 |
| 1.0.0 | 677 | 1/23/2020 |
1) Added ability to extract data from iframe blocks
2) Added possibility to take a screenshot in the project testing mode
3) Fixed current errors and increased productivity