method based on DOM tree analysis, and it can extract the
content of the page from a global view of the page not a local
view to some extent. It makes full use of the webpage layout
information, and guides the process of content extraction. By
recalling the sentences which the traditional method throws
away, it improves the performance of the traditional method
greatly.