Clone this repo:
  1. 9e79e1b Make package:html strong clean by Sigmund Cherem · 3 months ago master 0.13.0
  2. 9116929 Expand the version constraint to allow using csslib 0.13.x. by Bob Nystrom · 3 months ago 0.12.2+2
  3. 122292c re-publish package:html, but without the .packages file. Fixes #25262 by Sigmund Cherem · 7 months ago 0.12.2+1
  4. c3ef85f Prepare version for release by Sigmund Cherem · 11 months ago 0.12.2
  5. 7177493 Add Element.endSourceSpan by Konstantin Shcheglov · 11 months ago

html5 parser in dart

This is a pure Dart html5 parser. It‘s a port of html5lib from Python. Since it’s 100% Dart you can use it safely from a script or server side app.

Eventually the parse tree API will be compatible with dart:html, so the same code will work on the client and the server.

(Formerly known as html5lib.)

Installation

Add this to your pubspec.yaml (or create it):

dependencies:
  html: any

Then run the Pub Package Manager (comes with the Dart SDK):

pub install

Usage

Parsing HTML is easy!

import 'package:html/parser.dart' show parse;
import 'package:html/dom.dart';

main() {
  var document = parse(
      '<body>Hello world! <a href="www.html5rocks.com">HTML5 rocks!');
  print(document.outerHtml);
}

You can pass a String or list of bytes to parse. There's also parseFragment for parsing a document fragment, and HtmlParser if you want more low level control.

Running Tests

./test/run.sh