a general one with frequent words from the aforementioned training datasets and two additional lists with frequent words from two collections of tweets related to the stock market and films. To evaluate the classifiers generated, we have collected two additional independent datasets (English and multilanguage), containing 50000 and 200000 tweets respectively. Only the best scoring classifiers are listed in Table IV and are considered throughout the rest of this article.