Side note: Conditional independence of events does in fact not hold
– “New”/ “York“ and “Hong” / “Kong"
– Still, good accuracy can be achieved
Boolean representation simplistic
– Keyword counts lost
More elaborate probabilistic methods
– E.g. estimate probability of term v occurring in a document of class C by
relative frequency of v in all documents of the class
Other linear classification algorithms (machine learning) can be used
– Support Vector Machines, ..