mirror of
https://github.com/Ponce/slackbuilds
synced 2024-11-24 10:02:29 +01:00
02e5a1c05d
Signed-off-by: Larry Hajali <larryhaja[at]gmail[dot]com>
12 lines
493 B
Text
12 lines
493 B
Text
html5lib (HTML parser based on the HTML5 specification)
|
|
|
|
HTML parser designed to follow the HTML5 specification. The parser is
|
|
designed to handle all flavours of HTML and parses invalid documents
|
|
using well-defined error handling rules compatible with the behaviour of
|
|
major desktop web browsers.
|
|
|
|
Output is to a tree structure; the current release supports output
|
|
to DOM, ElementTree and lxml tree formats as well as a simple
|
|
custom format.
|
|
|
|
Optional: datrie, python-chardet, lxml and genshi
|