Information collection system requires crawlers and site nodes network distance to define and measure.
Confining websites link from crawlers to the internal cluster is the way to improve the localization ratio
of network flow.
In the crawler system, we are more concerned about the time of crawling pages, so network distance
in this paper is defined as the time of each crawler crawling each page. Information collection system
network distance is defined as follows