htmlcxx Publisher's description
from Tosin Komolafe
A C++ library to help you with your work.
htmlcxx is a simple non-validating html parser library for C++ designed to allow users to fully dump the original html document, character by character, from the parse tree. The library also has an intuitive tree traversal API.
Here are some key features of "htmlcxx":
В· STL like navigation of DOM tree, using excelent's tree.hh library
В· It is possible to reproduce exactly, character by character, the original document from the parse tree
В· Bundled css parser
В· Optional parsing of attributes
В· C++ code that looks like C++ (not so true anymore)
В· Offsets of tags/elements in the original document are stored in the nodes of the DOM tree
System Requirements:No special requirements.
Program Release Status:
Program Install Support: Install And Uninstall