Speedups
Uniform kernel (much faster but not as good)
Binning or hierarchical methods
Approximate nearest neighbor search
Methods to adapt kernel size depending on data density
Lots of theoretical support
D. Comaniciu and P. Meer, Mean Shift: A Robust Approach toward Feature Space Analysis, PAMI 2002.