作者: Suman Kalyan Maity , Ritvik Saraf , Animesh Mukherjee
关键词: Artificial intelligence 、 Baseline (configuration management) 、 Compounding 、 Computer science 、 Early prediction 、 Natural language processing 、 Natural language
摘要: Compounding of natural language units is a very common phenomena. In this paper, we show, for the first time, that Twitter hashtags which, could be considered as correlates such linguistic units, undergo compounding. We identify reasons compounding and propose prediction model can with 77.07% accuracy if pair in near future (i.e., 2 months after compounding) shall become popular. At longer times T = 6, 10 accuracies are 77.52% 79.13% respectively. This technique has strong implications to trending hashtag recommendation since newly formed compounds recommended early, even before taken place. Further, humans predict an overall only 48.7% (treated baseline). Notably, while discriminate relatively easier cases, automatic framework successful classifying harder cases.