small percent of the customers in the datasets are already buyers or responders, and we need to discover likely buyers from the current non-buyers. Due to confidentiality of the datasets, we can only give very a brief description. The first dataset is for a loan product promotion from a major bank in Canada. It has about 90,000 customers in total, and only 1.2 % are responders. Each customer is described by 55 attributes, about half are discrete and the other half numerical. After transforming some attributes (such as the area code), a total of 62 attributes are used for the data mining purpose. The second dataset is from a major life insurance company, for an RRSP (Registered Retirement Saving Plan) campaign. It has 80,000 customers in total, with an initial 7% buyer. Each customer is described by 10 numerical and discrete attributes. The third dataset is from a company which runs a particular “bonus program". The company has over 100 sponsors (retail stores, gas stations, etc.), and when customers buy certain products from the partners, a certain amount of bonus is accumulated, which can be redeemed for other services and products (from free flights to free movies). Naturally, the company is gathering a huge amount of information on what customers are buying every day. The company then mails commercial flyers to customers along with quarterly summary statements, and it wants to send the right set of commercial