From 17th May 2007, when the n-gram collection had started, and concluding with 31st January 2014, Hascheck
processed 6.83 million texts, which formed a corpus of 1.72 Gtokens. An average text length of approximately 250
tokens makes the system very suitable for purposes of n-gram collection.