Download of htmcleaner-2.20-src.zip (htmcleaner-2.20-src.zip ( external link: SF.net): 360,763 bytes) will begin shortly. If not so, click link on the left.
HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.