FRAMEWORK OF TEXT MINING
Text mining structure comprises of two modules: A. Text refining Text refining converts unstructured text documents into an intermediate form. B. Knowledge distillation In this process, Knowledge is gathered from intermediate form, which is obtained by text refining. The route by which the text mining works is that it transforms unstructured documents into Intermediate form (IF). The IF can be framed by two methods. They are: • Document based IF • Concept based IF Once information is converted into IF, knowledge distillation practice is executed that helps in transforming document based IF into clustered, categorized and virtualized information as well as concept based IF into predictive modelling, associate discovery or virtualized information IV. PROCESS OF TEXT MINING There are various steps involved in the process of Text Mining. The important steps are: A. Text Interpretation Text interpretation is used to make the information useful. This is the initial step and other steps of text mining are based on it. B. Text organisation: In this step the “ text pieces” obtained from the text interpretation are organised into some useful pattern. This leads to the formation of usable grid. Through these usable grids, knowledge discovery takes place by building taxonomies and ontologies. C. Format Format of the result is an important aspect. The format of the result output should be self-explanatory and denotes the discovery of knowledge