Based on (12), label-based accuracy accounts for true negative (tn) numbers. Therefore, under the situation that there are a large number of tweets in “others”, accuracy measures are much higher than F1 scores in Table 3. Label-based accuracy is not a very effective measure to account label imbalance here, so we do not use this measure in further discussion.