NOTE: This repository contains a patched version of uTidylib which includes all Debian patches and works with Mac OS X.
This is uTidylib, the Python wrapper for the HTML cleaning library named TidyLib. It supports both original Tidy <http://tidy.sf.net> and new HTML5 enabled Tidy <http://www.html-tidy.org/>.
Once installed, there are two ways to get help. The simplest is:
$ python
>>> import tidy
>>> help(tidy)
. . .
Then, of course, there's the API documentation, which is available at <http://utidylib.readthedocs.io/en/latest/>.
>>> import tidy
>>> print tidy.parseString(
... '<Html>Hello Tidy!',
... output_xhtml=1, add_xml_decl=1, indent=1, tidy_mark=0
... )
<?xml version="1.0" encoding="us-ascii"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title></title>
</head>
<body>
Hello Tidy!
</body>
</html>
Good luck!