如何用C/C++获取html或shtml文件的内容？

icu

如何用C/C++获取html或shtml文件的内容？ [复制链接]

普通的HTML和SHTML由于和XML格式不一样，所以我想用解析XML的方式行不通，而且解析太复杂。有没有简便的用C/C++或
API直接获取HTML中一块区域的方法，比如直接获取一个body等。

malaga

你找找有没有开源的代码了。

象mozilla的Gecko等等

donglele2005

不知道,我只是学了其中的两个,还没有想过这个问题.哪个高手帮帮忙撒,我们都很想知道的.

yuanyan

做过的人来说说。。。

zhongyinyuan

IWebBrowser2

yangtianle

不要想了.
自己写.
程序员就不应该怕累.我们读XML文件都自己写.

这样有好处.当然坏处就是浪费时间.

bfan

读写xml是有现成的代码可参考的，自己写当然不费时间，HTML可没有现成的解析代码。

fgaggag

解析html这些应该有开源代码可以借鉴的，帮老大搜索了下，找到了下面这些：

Steev's HTML Parser
Steev's HTML Parser is an HTML parsing library that builds a complete hierarchy for each element and attribute in the supplied HTML file. Each element is its own C++ class, replete with child nodes, allowing for full control and processing. An 'HTML beautifier' example is included.
网址： http://freshmeat.net/projects/steevshtmlparser/

htmlcxx
htmlcxx is a simple non-validating CSS1 and HTML parser for C++. The parsing politics attempt to mimic the behavior of Mozilla Firefox, so you should expect parse trees similar to those created by Firefox. However, it does not insert nonexistent stuff in your HTML. Therefore, serializing the DOM tree gives exactly the same output as the original HTML document. Another key feature is an STL-like tree navigation API provided by the tree.hh template library.
网址： http://freshmeat.net/projects/htmlcxx/

Xport toolkit
Xport is a C++ template class library that can be included in any C++ project to enable the creation and generation of XHTML documents. Although it was developed with the idea of creating XHTML documents for reporting purposes, Xport can be used to create XHTML documents for many other uses as well. It can easily generate and parse (X)HTML documents and stylesheets. It is intuitive to use, and allows many options for parsing and generating documents.
网址： http://freshmeat.net/projects/xporttoolkit/

搜索的方法我是在freshmeat网站搜索关键字 html parse 搜到的，上面的三个都是开源的，最后一个貌似很好很强大，希望对老大有用。

leijun203

这个要有个HTML的解析器吧.

netflu

只能慢慢找了，开源的不一定能移植到CE下，谢谢提供线索，不过解析HTML只是第一步，智能提取HTML元素是关键，如果有现成的算法就好了。

lijunjie307

学习

如何用C/C++获取html或shtml文件的内容？ [复制链接]

最新回复

浏览过的版块