This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents (or streams).
Project Url
View on NuGet:



Installing with NuGet

PM> Install-Package HtmlAgilityPack

Packages that Depend on HtmlAgilityPack

PackageLatest VersionTags
AdvancedAgilityPack 1.1.0
AdvancedWebClient 0.0.1
aemarcoCore 1.2.1 aemarco
AFL 1.2.105
AFLUmbraco 1.2.105
AjaxControlToolkit.HtmlEditor.Sanitizer 18.1.0 Ajax Toolkit
Amesto.AutoConnect.Core 2.0.4 translations episerver
Animatronio 2.0.65 Selenium WebDriver
AppWebService.Consumer 1.33.0
Arachnophile 1.0 Web Script Extension Extension Method
ASI.SeleniumExtensions Selenium Integration testing
Atoms.PreCompiler 1.1.162
Awful 0.9.1
AwfulForumsLibrary Something Awful Forums Lowtax YOSPOS
Azuria 0.6 proxer
BeautifulWeb 1.0.5
BetterCms.Module.LuceneSearch 2.0.7
BetterCms.Module.Root 2.0.8
BFound.HtmlToMarkdown 0.0.5 markdown html converter
BingSearcher 1.1.0 bing search lazy url
BLogic.Shared.Modules.Selenium 1.0.5 page-object-pattern page object pattern selenium test automation
Boilerpipe.Net 1.2.0 text boilerplate content extraction
Boilerpipe.Net.Core 1.0.1 Boilerpipe .NET Core Text Extraction
Catpic 0.7.7 gadget opensocial social
Cavity.Data.Html HTML Data
Cep 2.0.3 cep correios endereco address brazil brazilian zip code codigo endereçamento postal
CERNSSO 2.1.0 CERN Security SSO
Cireson.Platform.Extension.WebUi 0.1.10 ciresonCPEX
ClientiDw9 4.3.0 Clienti DynamicWeb
cloudscribe.SimpleContent.Web 2.0.20 cloudscribe blog
Coco.Web 0.3.0
CodeForceLib 2.0.5
CodeHollow.FeedReader 1.1.1 feed rss atom
CodeHollow.FeedReader.Core 1.0.0
Coderwall.Models 1.0.0
Cofoundry.Plugins.DependencyInjection.Autofac 0.1.0 Cofoundry Autofac Plugin DependencyInjection DI
Cofoundry.Plugins.DependencyInjection.Autofac.Web 0.1.0 Cofoundry Autofac Plugin DependencyInjection DI
Cofoundry.Plugins.ImageResizing.ImageResizer 0.1.0 Cofoundry ImageResizing Plugin ImageResizer
Coli.Framework.ReceitaFederal 1.0.5 cpf cnpj consulta receita federal
com.kaizengineering.Library 1.0.5611.22252 Logging
com.staticvoidlabs.nugetlibs.gtools 1.0.1 Google Tools Search
com.staticvoidlabs.nugetlibs.orfparser 0.9.0 ORF epg iptv
CrawlerLib.Engine 2.3.5544.21265 crawler-lib task processor workflow action async await crawler spider bot scraper twitter facebook google+ blog http html xml json rss feed datamining IAsync tpl webrequest extracting
CrystalWind.Net.WebCrawler 2.1.1
CSF.Zpt.DocumentProviders.HtmlHAP 1.0.5
CsfdAPI 2018.1.20.1 CSFD API
CssInliner 0.1.1 Css Html Email
DataCollectorLib 1.0.3
DevelopmentHelpers.FileContentReader 2.0.2
DicioAPI 2.0.0 Dicio Meanings Synonyms Sinônimos Significados Dicionário pt-BR
DocuPanel 0.3.0 WPF Markdown
doG.Web 0.6.2 doG .Net Core Portable RAD Rapid Application Development Typescript .Net Framemwork Innovative
Domi-UserCtrl 2.1.0 Miscellanea-WPF-UserCtrl
dotNetRDF 2.0.1 RDF Semantic Web SPARQL RDF/XML Turtle Notation3
DotnetSpider2.Core 2.4.4 DotnetSpider crawler cross platform dotnet core
DotSee.NodeRestrict 1.0.1 umbraco publish nodes doctypes
dotSkoob 1.0.0 Skoob books
Dragonfly.Net 1.7.1
Drool 1.3.7 smtp sendgrid mailgun
DT.MailServer 1.0.0
EclassApi 1.1.10 eclass eclassmobileapi gr
Elision.Foundation 1.0.0 sitecore elision foundation
Elision.Foundation-redist 1.0.0 sitecore elision foundation
EPiSearch 0.1 EPiServer full text search FTS better
epvpapi 1.1.1
EVE.Mvc 0.6.4
ExchangeAPI 1.480.1094 Betfair API API-NG Pinnacle API
ExtentReports 3.1.1 reporting api
FacebookPromotion Facebook app promotion downloads Jennifer Marsman AppBar button FacebookAppPromotionButton FacebookPromotion
FantasyPremierLeagueApi.Api 1.1.0 Fantasy Football FantasyPremierLeague FPL FantasyFootball
FaviconLoader 1.0.4
FFXIVAPP.Common 4.0.4 ffxiv ffxivapp common helpers
FileCurator 2.0.2 FileSystem File URL
Fizzler.Systems.HtmlAgilityPack 1.1.1 selectors w3c htmlagility css html
FlakEssentials.Web 2017.831.65 FlakEssentials Web
FluentBootstrap.Mvc Bootstrap AspNet Mvc AspNetMvc
FluentSharp.HtmlAgilityPack 5.5.172 FluentSharp Fluent OWASP O2Platform Security
FluentSharp.HtmlAgilityPack.WinForms 5.5.172 FluentSharp Fluent OWASP O2Platform Security
foxsoftware.frameworks.datacollector 0.1.4 parse html web content
FrwSoftware.FrwSimpleWinCRUD 1.2.2 ObjectListView CRUD Json
Fue 1.5.1 FSharp Templating F# Templates
FunAIInc.BotKitty.BotPlugin 1.2.0 BotKitty
Generator 1.0.9
GeocachingToolbox geocaching opencaching
GitHubOAuth2Client-Redux GitHub OAuth OAuth2 DotNetOpenAuth
Golem 2.2.1 Selenium WebDriver Framework Golem ProtoTest Gallio MbUnit Rest White TestStack Test Testing
Gravity.Services.Comet 2018.3.17.1 automation crawling crawl selenium appium automation testing automation test qa gravity gravity api scrapping scrap
Gravity.Services.Core 2017.10.18.1 automation crawling crawl selenium appium automation testing automation test qa gravity gravity api scrapping scrap
Heatstat.Parser 1.0.4 heatstat baseball
Helpers.C3PO 2.0.4
Hext.dll 1.0.0 HtmlAgilityPack Html Agility Pack HTML AgilityPack HTMLAgilityPack HTMLAgility html agility pack htmlagility agilitypack htmlagilitypack Scraping .NET C# Parsing hext Hext