Download of htmlcleaner-2.6-src.zip (htmlcleaner-2.6-src.zip ( external link: SF.net): 505,076 bytes) will begin shortly. If not so, click link on the left.
HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.