Accurate network distance prediction between information collection node and websites distributed in
different locations is the basis for collaborative information collection, and also localize the process of
collecting sites, thereby reduce the network distance overhead, improve collection efficiency, reduce
network load, enhance fault-tolerance capability[1], which is of great significance.