Step 2: BoVW vocabulary development: In a typical Bo VW
framework the interest points from a set of images ae first
detected and represented using a descriptor [17]. Then an
unsupervised grouping is performed over the entire set of
descriptors to generate k-clusters. Each cluster is called a
visual word and the code book of visual words represents the
system vocabulary.