CONTENT ANALYZER – When information has no structure (e.g. text), some
kind of pre-processing step is needed to extract structured relevant information.
The main responsibility of the component is to represent the content of items
(e.g. documents, Web pages, news, product descriptions, etc.) coming from information sources in a form suitable for the next processing steps. Data items are
analyzed by feature extraction techniques in order to shift item representation
from the original information space to the target one (e.g. Web pages represented
as keyword vectors). This representation is the input to the PROFILE LEARNER
and FILTERING COMPONENT;