In order to be successful, text mining studies should follow a sound methodology based on best practices.
A standardized process model is needed similar to CRISP-DM, which is the industry standard for data mining projects (see Chapter 5).
Even though most parts of CRISP-DM are also applicable to text mining projects, a specific process model for text mining would include mush more elaborate data preprocessing activities.
Figure 7.4 depicts a high-level context diagram of a typical text mining process (Delen and Crossland, 2008).