
This article present yet another HTML Parser based on the Microsoft XML Technology. The article will demonstrate basic parsing of HTML documents, including attribute enumeration, from a Console Application using C++. In addition, one can choose disk based files rather than 'in memory' pages since the later sample programs provide a FileStream class which inherits from IStream. Parsing of web based documents is demonstrated using URLOpenBlockingStream. Should the reader desire a Windows Application, please see Philip Patrick's Loading and Parsing HTML using MSHTML, Irek Zielinski's HTML Editor for VC++ 6.0 , or HTML Reader C++ Class Library. ..
|