• Data Integration Capability
– Apache Sqoop: a tool designed for transferring data from
a relational database directly into HDFS or into Hive
[12,18]. It automatically generates classes needed to
import data into HDFS after analyzing the schema’s
tables; then the reading of tables’ contents is a parallel
MapReduce job;
– Flume is a distributed, reliable, and available service
for efficiently collecting, aggregating, and moving large
amounts of log data. It is designed to import streaming
data flows [12,27].
• Data Integration Capability– Apache Sqoop: a tool designed for transferring data froma relational database directly into HDFS or into Hive[12,18]. It automatically generates classes needed toimport data into HDFS after analyzing the schema’stables; then the reading of tables’ contents is a parallelMapReduce job;– Flume is a distributed, reliable, and available servicefor efficiently collecting, aggregating, and moving largeamounts of log data. It is designed to import streamingdata flows [12,27].
การแปล กรุณารอสักครู่..