7 CONCLUSION We propose ScaleGPU, a novel GPU architecture to enable highperformance memory-unaware GPU programming. ScaleGPU uses GPU memory as a cache of the CPU memory to enable memoryunaware GPU programming. ScaleGPU achieves a significant performance improvement, while also reducing the memory requirement significantly.