such data can be highly structured (data from relational
databases), semi-structured (web logs, social media feeds, raw
feed directly from a sensor source, email, etc.) or unstructured
(video, still images, audio, clicks) [12]. Another “V”, for Variability,
can be added to variety to emphasize on semantics,
or the variability of meaning in language and communication
protocols.