this paper
is to highlight the important techniques and methodologies
that are employed in text documents classification, while at
the same time making awareness of some of the interesting
challenges that remain to be solved, focused mainly on text
representation and machine learning techniques. This paper
provides a review of the theory and methods of document
classification and text mining, focusing on the existing literature.