GeekArticles
Web Programming
Python
Web Programming
PythonWrestling HTML
by Uche Ogbuji September 08, 2004 Lately I've seen HTML parsing problems everywhere. One project needed a web crawler with specialized features provided through Python code that processed arbitrary HTML. There have also been several threads on mailing lists I frequent (including XML-SIG ) featuring discussions of mechanisms for dealing with broken HTML by converting it to decent XHTML. This article focuses on Python APIs for converting good or bad HTML to XML. Based on glowing...n
Sponsored Links
Read Next: Look Ma, No Tags
Related Topics
Subscribe via RSS
Python
- Lonely Python
- Python on the Side
- The Python Pattern
- The IPC10 Python Gathering
- What's New in Python 2.3?
- Develop Python/XML with 4Suite, Part 5: The repository features
- The State of the Python-XML Art
- Python and XML development using 4Suite, Part 3: 4RDF
- The State of the Python-XML Art, 2003
- Learn to Program using Python: Lesson 1, Getting Started
