HTML/XML Parser for Python Maintained by:mwm@mired.org Port Added: 25 Jul 2012 16:58:50 Also Listed In:python License: not specified in port
Beautiful Soup parses arbitrarily invalid XML- or HTML-like substance
into a tree representation. It provides methods and Pythonic idioms
that make it easy to search and modify the tree.
A well-formed XML/HTML document will yield a well-formed data
structure. An ill-formed XML/HTML document will yield a
correspondingly ill-formed data structure. If your document is only
locally well-formed, you can use this library to find and process the
well-formed part of it. The BeautifulSoup class has heuristics for
obtaining a sensible parse tree in the face of common HTML errors.
WWW: http://www.crummy.com/software/BeautifulSoup/
- copy py-beautyfulsoup 3.2.0 to py-beautyfulsoup32
- update py-beautyfulsoup to current stable branch (4.1.1)
- pass maintainership of this new port to submitter
- set USE_PYTHON to 2.7, because it doesn't packages with python3
(port modification needed)
- chase dependency update to deskutils/calibre
- add UPDATING entry
PR: 168372 (based on)
Submitted by: William Grzybowski <william88 at gmail dot com>
Approved by: Mike Meyer <mwm at mired dot org> (maintainer)