In that case we need to process complete data set for almost every query on the data to get the desired result. If we process this huge amount of data on a single machine in linear fashion then we face problems like data storage issues and response time. So we need a mechanism which can process huge data sets in parallel fashion on distributed machines.