python3-html5-parser 0.4.10-5 (amd64 binary) in ubuntu kinetic
A fast implementation of the HTML 5 parsing spec for Python. Parsing is
done in C using a variant of the gumbo parser. The gumbo parse tree is
then transformed into an lxml tree, also in C, yielding parse times that
can be a thirtieth of the html5lib parse times. That is a speedup of 30x.
This differs, for instance, from the gumbo python bindings, where the
initial parsing is done in C but the transformation into the final
tree is done in python.
Details
- Package version:
- 0.4.10-5
- Status:
- Published
- Component:
- universe
- Priority:
- Optional
Downloadable files
amd64 build of html5-parser 0.4.10-5 in ubuntu kinetic PROPOSED produced
these files:
- python3-html5-parser_0.4.10-5_amd64.deb (144.1 KiB)
Package relationships
- Provides:
- python3.10-html5-parser