In order to identify (discriminate) different
domains within the documents found for each
entity, clustering techniques are used. In the
experiments, we use Lingo and STC, both part of
the Carrot2 framework (Carrot2, 2009). Even
though the FVC algorithm is not designed for any
particular clustering algorithm, we need to test
whether a change of algorithm has any major
impact on the feature vector quality.