Our results show that the tradeoffs on this new platform are quite different from those found in a parallel RDBMS, due to deliberate design choices that sacrifice performance for scalability in MapReduce. An important trend is the development of declarative query languages [10, 11, 15, 24] that sit on top of MapReduce. Our findings provide an important first step for query optimization in these languages.