Organization. At this point the architecture has to deal with various data formats (texts formats, compressed files,
variously delimited, etc.) and must be able to parse them and extract the actual information like named entities, relation
between them, etc.