This results in underutilized compute nodes and consequently inefficient recomputation. The second challenge is that hot-spots appear during the recomputation of a job’s mappers. In the initial run of the job, mapper accesses to input are essentially balanced over all nodes. The number of concurrent accesses on one nodeis limited. We find that during recomputation, mapper accesses can concurrently concentrate on one or a few storage locations.