where p is the number of kernels, N is the number
of hid-den layer nodes, d is the maximum distance
between each data point to the mean, xj is a sampled
data present at the jth node. ci is the kernel value
and wi is the weights linking between input nodes.