However,current data mining toolkits cannot meet the requirement of applications with
large-scale databases in terms of speed. In this paper, we propose three techniques to speedup fundamental problems in data mining algorithms on the CUDA platform:
scalable thread scheduling scheme for irregular pattern, parallel distributed top-k
scheme, and parallel high dimension reduction scheme.