Cloud computing is a powerful technology to perform massive-scale and complex
computing. It eliminates the need to maintain expensive computing hardware, dedicated
space, and software. Massive growth in the scale of data or big data generated through
cloud computing has been observed. Addressing big data is a challenging and timedemanding
task that requires a large computational infrastructure to ensure successful
data processing and analysis. The rise of big data in cloud computing is reviewed in this
study. The definition, characteristics, and classification of big data along with some
discussions on cloud computing are introduced. The relationship between big data and
cloud computing, big data storage systems, and Hadoop technology are also discussed.
Furthermore, research challenges are investigated, with focus on scalability, availability,
data integrity, data transformation, data quality, data heterogeneity, privacy, legal and
regulatory issues, and governance. Lastly, open research issues that require substantial
research efforts are summarized.