Tech Tutorials Database
GeekArticles Web Programming Python
 

Wrestling HTML

 
Author: xml.com
Category: Python
Comments (0)

by Uche Ogbuji September 08, 2004 Lately I've seen HTML parsing problems everywhere. One project needed a web crawler with specialized features provided through Python code that processed arbitrary HTML. There have also been several threads on mailing lists I frequent (including XML-SIG ) featuring discussions of mechanisms for dealing with broken HTML by converting it to decent XHTML. This article focuses on Python APIs for converting good or bad HTML to XML. Based on glowing...n

Read More...




Sponsored Links




Read Next: Look Ma, No Tags



 

 

Comments



Post Your Comment:

Your Name:*
e-mail ID:(required for notification)*
Image Verification: 
 
 Subscribe