Data Mining is the nontrivial process of identifying valid, novel, potentially useful and ultimately
understandable pattern in data with the wide use of databases and the explosive growth in their sizes. Data
mining refers to extracting or “mining” knowledge from large amounts of data. Data mining is the search for the
relationships and global patterns that exist in large databases but are hidden among large amounts of data. The
essential process of Knowledge Discovery is the conversion of data into knowledge in order to aid in decision
making, referred to as data mining. Knowledge Discovery process consists of an iterative sequence of data
cleaning, data integration, data selection, data mining pattern recognition and knowledge presentation. Data
mining is the search for the relationships and global patterns that exist in large databases bur are hidden among
large amounts of data.
Many hospital information systems are designed to support patient billing, inventory management and
generation of simple statistics. Some hospitals use decision support systems, but are largely limited. They can
answer simple queries like “What is the average age of patients who have heart disease?” , “How many
surgeries had resulted in hospital stays longer than 10 days?”, “Identify the female patients who are single,
above 30 years old, and who have been treated for cancer.” However they cannot answer complex queries like
“Given patient records, predict the probability of patients getting a heart disease.” Clinical decisions are often
made based on doctors’ intuition and experience rather than on the knowledge rich data hidden in the database.
This practice leads to unwanted biases, errors and excessive medical costs which affects the quality of service
provided to patients. The proposed system that integration of clinical decision support with computer-based
patient records could reduce medical errors, enhance patient safety, decrease unwanted practice variation, and
improve patient outcome. This suggestion is promising as data modeling and analysis tools, e.g., data mining,
have the potential to generate a knowledge rich environment which can help to significantly improve the quality
of clinical decisions.