Data mining is widely used in various domains and has
significant applications. However, current data mining
tools cannot meet the requirement of applications
with large‐scale databases in terms of speed. We
propose three techniques to accelerate fundamental
kernels in data mining algorithms on CUDA platform,
scalable thread scheduling scheme for irregular
pattern, parallel distributed top‐k scheme, and parallel
high dimension reduction scheme. They play a key role
in our GUCAS_CU‐Miner, including three
representative data mining algorithms, CU‐Apriori, CUKNN
and CU‐K‐means.