The main hardware costs of the GPU-CC architecture
are the configuration registers and FIFO buffers. Each of
the 32 cores has a configuration register and three 16 element
FIFOs. Each of the load-store units also has an instruction
cache, one 256 element and one 16 element FIFO.