We used two data sources, Wikipedia article access logs and
official disease incidence reports, and built linear models to
analyze approximately 3 years of data for each of 14 diseaselocation
contexts. This section details the nature, acquisition, and
processing of these data as well as how we computed the
estimation models and evaluated their output.