where xj is the count of some item, J is the total number of
possible items (i.e., vocabulary size), M is the estimated metric value, and aj are selected by linear regression or similar methods. When appropriately trained, these methods can be quite accurate;
for example, many of the cited models can produce near real-time
estimates of case counts with correlations upwards of r = 0.95.