python3-html5-parser 0.4.9-1 (i386 binary) in ubuntu focal
A fast implementation of the HTML 5 parsing spec for Python. Parsing is
done in C using a variant of the gumbo parser. The gumbo parse tree is
then transformed into an lxml tree, also in C, yielding parse times that
can be a thirtieth of the html5lib parse times. That is a speedup of 30x.
This differs, for instance, from the gumbo python bindings, where the
initial parsing is done in C but the transformation into the final
tree is done in python.
Details
- Package version:
- 0.4.9-1
- Status:
- Deleted
- Component:
- universe
- Priority:
- Optional
Downloadable files
i386 build of html5-parser 0.4.9-1 in ubuntu focal PROPOSED produced
these files:
- python3-html5-parser_0.4.9-1_i386.deb (132.0 KiB)