TEXT MINING
Text mining is made up of two words: “Text” & “Mining”. It literally means the extraction of large volume of text to find the relevant information. It is also called as text data mining [6] and knowledge discovery from word-based databases [5]. In simple words it refers to a process of hauling out thought-provoking and stimulating knowledge from unstructured text documents. Text mining is similar to data mining but it is an extended form of data mining. It leads to discovery of new knowledge from large volume of the existing unstructured data. Text is the most accepted form of storing information; hence text mining as a tool can be very useful for any organisation to take the advantages in this competitive age. The importance of text mining becomes even more when we find that the proportion of unstructured data is very high. Both text mining and data mining are usually used inter changeable, but when we compare both, we find that text mining is a much more intricate task, because it comprises of text data that are naturally unstructured and vague. Text mining is a multidisciplinary field. Text mining involves various processes and related to text analysis, information extraction, information retrieval, clustering, categorization, visualization, database technology, machine learning and data mining [2].