category: Python requires: python2 python2-lxml python2-six python2-webencodings python2-chardet sdesc: "Python WHATWG HTML parser" ldesc: "html5lib is a pure-python library for parsing HTML. It is designed to conform to the WHATWG HTML specification, as is implemented by all major web browsers." external-source: python-html5lib