In the case of our study, the tweets are user-generated content on social media. A small number of common student problems appear in high frequency, and a large number of less common problems or noisy tweets each appear in very low frequency. This indicates a “long tail” character. It is a challenge and also our future work to reveal more insightful information from this long tail.