Models should afford review, replication,
improvement, and deployment by third parties. This guarantees a
high-quality scientific basis, continuity of operations, and broad
applicability. These requirements imply that model algorithms —
in the form of source code, not research papers — must be
generally available, and they also imply that complete input data
must be available. The latter is the key obstacle, as terms are
dictated by the data owner rather than the data user; this
motivated our exploration of Wikipedia access logs. To our
knowledge, no models exist that use both open data and open
algorithms.