The task of organising knowledge contained in
such resources to facilitate their effective retrieval
is gigantic if it were to be carried out manually.
The approach to classification has to be necessarily
machine-based. The most elementary approach,
also employed by many of the Web search engines,
would be to consider every word (every Keyword)
in a document as defining a class to which the
document belongs and build a huge term-document
matrix as below: