Box 1.2 Model selection and inference
Inference from models can take many forms, some of which are
misleading. For example, collection of large amounts of data as fodder
for multivariate models without a clear purpose can lead to
spurious results (Rexstad et al. 1988; Anderson et al. 2001). A relatively
new wave of model selection and inference, however, is based
on information theoretic approaches. Burnham and Anderson
(1998:1) describe this as “making valid inferences from scientific
data when a meaningful analysis depends on a model.” This approach
is based on the concept that the data, no matter how large
the data set, will only support limited inference. Thus, a proper
model has: (1) the full support of the data, (2) enough parameters
to avoid bias, and (3) not too many parameters (so that precision is
not lost). The latter two criteria combine to form the “Principle of
Parsimony” (Burnham and Anderson 1992): a trade off between the
extremes of underfitting (not enough parameters) and overfitting
(too many parameters) the model, given a set of a priori alternative
models for the analysis of a given data set.
Box 1.2 Model selection and inference
Inference from models can take many forms, some of which are
misleading. For example, collection of large amounts of data as fodder
for multivariate models without a clear purpose can lead to
spurious results (Rexstad et al. 1988; Anderson et al. 2001). A relatively
new wave of model selection and inference, however, is based
on information theoretic approaches. Burnham and Anderson
(1998:1) describe this as “making valid inferences from scientific
data when a meaningful analysis depends on a model.” This approach
is based on the concept that the data, no matter how large
the data set, will only support limited inference. Thus, a proper
model has: (1) the full support of the data, (2) enough parameters
to avoid bias, and (3) not too many parameters (so that precision is
not lost). The latter two criteria combine to form the “Principle of
Parsimony” (Burnham and Anderson 1992): a trade off between the
extremes of underfitting (not enough parameters) and overfitting
(too many parameters) the model, given a set of a priori alternative
models for the analysis of a given data set.
การแปล กรุณารอสักครู่..
