![]() |
VOOZH | about |
dotnet add package Devlooped.Web --version 1.4.0
NuGet\Install-Package Devlooped.Web -Version 1.4.0
<PackageReference Include="Devlooped.Web" Version="1.4.0" />
<PackageVersion Include="Devlooped.Web" Version="1.4.0" />Directory.Packages.props
<PackageReference Include="Devlooped.Web" />Project file
paket add Devlooped.Web --version 1.4.0
#r "nuget: Devlooped.Web, 1.4.0"
#:package Devlooped.Web@1.4.0
#addin nuget:?package=Devlooped.Web&version=1.4.0Install as a Cake Addin
#tool nuget:?package=Devlooped.Web&version=1.4.0Install as a Cake Tool
Read HTML as XML and query it with CSS over XLinq (or HtmlAgilityPack killer 😉).
Provides HtmlDocument.Load and CssSelectElement(s) extension methods
for XDocument/XElement.
No need to learn an entirely new object model for a page 🤘. This makes it the most productive and lean library for web scraping using the latest and greatest that .NET can offer.
using System.Xml.Linq;
using Devlooped.Web;
XDocument page = HtmlDocument.Load("page.html")
IEnumerable<XElement> elements = page.CssSelectElements("div.menuitem");
XElement title = page.CssSelectElement("html head meta[name=title]");
By default, HtmlDocument.Load will skip non-content elements script and
style, turn all element names into lower case, and ignore all XML namespaces
(useful when loading XHTML, for example) for easier querying. These options
as well as granular whitespace handling can be configured using the overloads
receiving an HtmlReaderSettings.
The underlying parsing is performed by the amazing SgmlReader library by Microsoft's Chris Lovett.
In addition, the following extension methods make it easier to work with XML documents where you want to query with CSS or XPath without having to deal with XML namespaces:
using System.Xml;
using System.Xml.Linq;
using Devlooped.Web;
var doc = XDocument.Load("doc.xml")
// Will remove all xmlns declarations, and allow querying elements
// as if none had namespaces, returns the root element
XElement nons = doc.RemoveNamespaces();
// Alternatively, you can also ignore at the XmlReader level
using var reader = XmlReader.Create("doc.xml").IgnoreNamespaces();
doc = XDocument.Load(reader);
// Finally, you can also skip elements at the reader level
using var reader = XmlReader.Create("doc.xml").SkipElements("foo", "bar");
doc = XDocument.Load(reader);
At the moment, supports the following CSS selector features:
And all combinators
Non-CSS features:
text() pseudo-attribute selector: selects the node text contents, as specified
in the XPath text() location
path. Can be used instead of an attribute name selector, such as div[text()=foo].
All attribute value selectors
are also supported:
[text()=val]: Represents an element whose text contents is exactly "val".[text()~=val]: Represents an element whose text contents is a whitespace-separated list of words,
one of which is exactly "val". If "val" contains whitespace, it will never represent anything (since the words
are separated by spaces). Also if "val" is the empty string, it will never represent anything.[text()|=val]: Represents an element whose text contents either being exactly "val" or
beginning with "val" immediately followed by "-" (U+002D).[text()^=val]: Represents an element whose text contents begins with the prefix "val".
If "val" is the empty string then the selector does not represent anything.[text()$=val]: Represents an element whose text contents ends with the suffix "val".
If "val" is the empty string then the selector does not represent anything.[text()*=val]: Represents an element whose text contents contains at least one instance of the
substring "val". If "val" is the empty string then the selector does not represent anything.To ensure the long-term sustainability of this project, users of this package who generate revenue must pay an Open Source Maintenance Fee. While the source code is freely available under the terms of the , this package and other aspects of the project require .
To pay the Maintenance Fee, become a Sponsor at the proper OSMF tier. A single fee covers all of Devlooped packages.
👁 Clarius Org
👁 MFB Technologies, Inc.
👁 SandRock
👁 DRIVE.NET, Inc.
👁 Keith Pickford
👁 Thomas Bolon
👁 Kori Francis
👁 Uno Platform
👁 Reuben Swartz
👁 Jacob Foshee
👁 alternate text is missing from this package README image
👁 Eric Johnson
👁 Jonathan
👁 Ken Bonny
👁 Simon Cropp
👁 agileworks-eu
👁 Zheyu Shen
👁 Vezel
👁 ChilliCream
👁 4OTC
👁 domischell
👁 Adrian Alonso
👁 Michael Hagedorn
👁 torutek
👁 mccaffers
👁 Seika Logiciel
👁 Andrew Grant
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net5.0 net5.0 was computed. net5.0-windows net5.0-windows was computed. net6.0 net6.0 was computed. net6.0-android net6.0-android was computed. net6.0-ios net6.0-ios was computed. net6.0-maccatalyst net6.0-maccatalyst was computed. net6.0-macos net6.0-macos was computed. net6.0-tvos net6.0-tvos was computed. net6.0-windows net6.0-windows was computed. net7.0 net7.0 was computed. net7.0-android net7.0-android was computed. net7.0-ios net7.0-ios was computed. net7.0-maccatalyst net7.0-maccatalyst was computed. net7.0-macos net7.0-macos was computed. net7.0-tvos net7.0-tvos was computed. net7.0-windows net7.0-windows was computed. net8.0 net8.0 was computed. net8.0-android net8.0-android was computed. net8.0-browser net8.0-browser was computed. net8.0-ios net8.0-ios was computed. net8.0-maccatalyst net8.0-maccatalyst was computed. net8.0-macos net8.0-macos was computed. net8.0-tvos net8.0-tvos was computed. net8.0-windows net8.0-windows was computed. net9.0 net9.0 was computed. net9.0-android net9.0-android was computed. net9.0-browser net9.0-browser was computed. net9.0-ios net9.0-ios was computed. net9.0-maccatalyst net9.0-maccatalyst was computed. net9.0-macos net9.0-macos was computed. net9.0-tvos net9.0-tvos was computed. net9.0-windows net9.0-windows was computed. net10.0 net10.0 was computed. net10.0-android net10.0-android was computed. net10.0-browser net10.0-browser was computed. net10.0-ios net10.0-ios was computed. net10.0-maccatalyst net10.0-maccatalyst was computed. net10.0-macos net10.0-macos was computed. net10.0-tvos net10.0-tvos was computed. net10.0-windows net10.0-windows was computed. |
| .NET Core | netcoreapp2.0 netcoreapp2.0 was computed. netcoreapp2.1 netcoreapp2.1 was computed. netcoreapp2.2 netcoreapp2.2 was computed. netcoreapp3.0 netcoreapp3.0 was computed. netcoreapp3.1 netcoreapp3.1 was computed. |
| .NET Standard | netstandard2.0 netstandard2.0 is compatible. netstandard2.1 netstandard2.1 was computed. |
| .NET Framework | net461 net461 was computed. net462 net462 was computed. net463 net463 was computed. net47 net47 was computed. net471 net471 was computed. net472 net472 was computed. net48 net48 was computed. net481 net481 was computed. |
| MonoAndroid | monoandroid monoandroid was computed. |
| MonoMac | monomac monomac was computed. |
| MonoTouch | monotouch monotouch was computed. |
| Tizen | tizen40 tizen40 was computed. tizen60 tizen60 was computed. |
| Xamarin.iOS | xamarinios xamarinios was computed. |
| Xamarin.Mac | xamarinmac xamarinmac was computed. |
| Xamarin.TVOS | xamarintvos xamarintvos was computed. |
| Xamarin.WatchOS | xamarinwatchos xamarinwatchos was computed. |
Showing the top 4 NuGet packages that depend on Devlooped.Web:
| Package | Downloads |
|---|---|
|
Devlooped.Xml.Css
Superseded by Devlooped.Web |
|
|
Devlooped.Html
Superseded by Devlooped.Web |
|
|
Devlooped.Sponsors.Commands
Commands used by the dotnet-sponsor CLI to manage sponsorship manifests. |
|
|
Devlooped.Epub
Lightweight read-only API for processing EPUB documents. |
This package is not used by any popular GitHub repositories.