can be diverse for a variety of reasons. A good partition
method should take this into consideration instead of
always dividing the work evenly among all reducers.
In this paper, we present a new strategy called LIBRA
(Lightweight Implementation of Balanced Range Assignment) to solve the data skew problem for reduce-side applications in MapReduce. Compared to the previous work, our
contributions include the following