Sample Applications Using See5/C5.0
Profiling High Income Earners from Census Data
Assessing Churn Risk
Detecting Advertisements on the Web
Identifying Spam
Diagnosing Hypothyroidism
Now Read On ...
This page should give you a feel for the kinds of results achievable with See5 (the Windows version) and C5.0 (the Linux version).
So what makes See5/C5.0 different? One short answer is its attention to the issue of comprehensibility. At RuleQuest, we believe that a data mining system should find patterns that provide insight in addition to supporting accurate predictions. In line with this approach, See5/C5.0 emphasizes rule-based classifiers because they are easier to understand -- each rule can be examined and validated separately, without having to consider it in the context of the classifier as a whole.
You'll also notice that See5/C5.0 is fast -- each of these examples requires at most a few seconds. (The times shown here are for C5.0 on a 3.4GHz Intel Core i7 PC running CentOS 6.) See5/C5.0 can also generate decision trees, useful in situations where classifiers must be constructed even more quickly.
Without more ado, let's jump straight in ...
Profiling High Income Earner