In our view, KDD refers to the overall process
of discovering useful knowledge from data,
and data mining refers to a particular step
in this process. Data mining is the application
of specific algorithms for extracting patterns
from data. The distinction between the KDD
process and the data-mining step (within the
process) is a central point of this article. The
additional steps in the KDD process, such as
data preparation, data selection, data cleaning,
incorporation of appropriate prior knowledge,
and proper interpretation of the results of
mining, are essential to ensure that useful
knowledge is derived from the data. Blind application
of data-mining methods (rightly criticized
as data dredging in the statistical literature)
can be a dangerous activity, easily
leading to the discovery of meaningless and
invalid patterns.
associations, changes, anomalies and signi®cant structures
from large amounts of data stored in databases,
data warehouses, or other information repositories. It
can be used to help companies to make better decision
to stay competitive in the marketplace. The major data
mining functions that are developed in commercial
and research communities include summarization,
association, classi®cation, prediction and clustering.
These functions can be implemented using a variety of
technologies,
In our view, KDD refers to the overall processof discovering useful knowledge from data,and data mining refers to a particular stepin this process. Data mining is the applicationof specific algorithms for extracting patternsfrom data. The distinction between the KDDprocess and the data-mining step (within theprocess) is a central point of this article. Theadditional steps in the KDD process, such asdata preparation, data selection, data cleaning,incorporation of appropriate prior knowledge,and proper interpretation of the results ofmining, are essential to ensure that usefulknowledge is derived from the data. Blind applicationof data-mining methods (rightly criticizedas data dredging in the statistical literature)can be a dangerous activity, easilyleading to the discovery of meaningless andinvalid patterns. associations, changes, anomalies and signi®cant structuresfrom large amounts of data stored in databases,data warehouses, or other information repositories. Itcan be used to help companies to make better decisionto stay competitive in the marketplace. The major datamining functions that are developed in commercialand research communities include summarization,association, classi®cation, prediction and clustering.These functions can be implemented using a variety oftechnologies,
การแปล กรุณารอสักครู่..