P(X) is constant for all classes.
Only the maximum P(X|Ci)P(Ci) needs to be calculated.
If no samples are given, then all classes are assumed as equally likely. Therefore,
P(X|Ci) would be maximized rather than maximizing
P(X|Ci)P(Ci) To determine P(Ci):