A better article selection procedure is needed. In the current
paper, we tried a simple manual process yielding at most a few
dozen candidate articles in order to establish feasibility.
However, real techniques should use a comprehensive process
that evaluates thousands, millions, or all plausible articles for
inclusion in the model. This will also facilitate content analysis
studies that evaluate which types of articles are predictive of
disease incidence.