We propose a data mining approach to predict the success of telemarketing calls for selling bank long-term
deposits. A Portuguese retail bank was addressed, with data collected from 2008 to 2013, thus including the effects
of the recent financial crisis. We analyzed a large set of 150 features related with bank client, product and
social-economic attributes. A semi-automatic feature selection was explored in the modeling phase, performed
with the data prior to July 2012 and that allowed to select a reduced set of 22 features. We also compared four
Data Mining models: logistic regression, decision trees, neural network and support vector machine. Using
two metrics, area of the receiver operating characteristic curve and area of the cumulative curve
, the four models were tested on an evaluation set, using the most recent data (after July 2012) and a
rolling window scheme. The NN presented the best results, allowing to reach 79% of the subscribers by selecting the half better classified clients. Also, two knowledge extraction methods, a sensitivity analysis, were applied to the NN model and revealed several key attributes. Such knowledge extraction confirmed the obtained
model as credible and valuable for telemarketing campaign managers.
We propose a data mining approach to predict the success of telemarketing calls for selling bank long-termdeposits. A Portuguese retail bank was addressed, with data collected from 2008 to 2013, thus including the effectsof the recent financial crisis. We analyzed a large set of 150 features related with bank client, product andsocial-economic attributes. A semi-automatic feature selection was explored in the modeling phase, performedwith the data prior to July 2012 and that allowed to select a reduced set of 22 features. We also compared fourData Mining models: logistic regression, decision trees, neural network and support vector machine. Usingtwo metrics, area of the receiver operating characteristic curve and area of the cumulative curve, the four models were tested on an evaluation set, using the most recent data (after July 2012) and arolling window scheme. The NN presented the best results, allowing to reach 79% of the subscribers by selecting the half better classified clients. Also, two knowledge extraction methods, a sensitivity analysis, were applied to the NN model and revealed several key attributes. Such knowledge extraction confirmed the obtainedmodel as credible and valuable for telemarketing campaign managers.
การแปล กรุณารอสักครู่..