The rest of the paper is organized as follows. Section 2
provides a background on MapReduce and the causes of
data skew. Section 3 describes the implementation of our
LIBRA system and Section 4 presents its algorithm details.
Performance evaluation is in Section 5. Section 6 discusses
related work. Section 7 concludes.