The heterogeneous multicore architecture we address in this work is depicted in Figure 2. This design is modelled after Intel Sandy Bridge [5]. The processor consists of several CPU and GPU cores each with its own private cache. These cores share the LLC and DRAM controllers, and the modules communicate through an on-chip interconnection network. Efficient sharing of on-chip resources is critical to the performance of a multicore processor. The last-level cache is one of the most important among these resources.