The high leakage power of the LLC comes from its large size, and its size comes
from conservative design-time choices that aim to accommodate most applications’
memory footprints. However, not all workloads running on CMPs need the entire
cache during their execution. Figure 2(a) illustrates the variable sensitivity of workloads
to changes in LLC capacity on a 16-core system. On the x-axis are multiprogrammed
workloads composed of benchmarks with different demands on capacity (see
Section 6 for workloads and simulation details). For example, workloads LL1 and
LL2 do not benefit from a larger capacity, while the performance of TH1 and TH2
improves significantly when a larger LLC is employed. Further, the required cache
size may also vary with different program phases, as shown in Figure 2(b). When
the required cache size is smaller, some parts of the LLC can be disabled to reduce
leakage power. In Figure 2(a), for example, if a 5% performance degradation is acceptable,
more than half of the LLC can be disabled to save power in all but two
workloads.