The online enquiries of users, no matter whether it focuses on Web database or texts on web pages, are basically interactive, that is, the request for enquiry is input on the interface of specific search websites, hyperlinks meeting the requirements are shown on the pages or data contents are extracted, and sheets are established.
The purpose that users make enquiries certainly is to obtain useful data. In Web data mining, if sequential characteristic is taken as main line, with the idea from semi-structuring to structuring, from structuring to association rule clustering and knowledge discovery, from association rule clustering and knowledge discovery to the expression of data structuring, the display of hyperlinks is sorted according to the level of effectiveness, so that retrieval is more effective.
If structured information expresses the primitive characters generated by information dominantly, semistructured
information includes the nature of information recessively, and implies more key contents which can be used.
Most information and knowledge lie in "semistructuring". The management of semi-structured information and knowledge will make the efficiency of Web data mining improved. It is a piece of very significant work.