作者: Olivier Janssens , Steven Verstockt , Erik Mannens , Sofie Van Hoecke , Rik Van de Walle
DOI: 10.1007/978-3-319-13817-6_12
关键词: Statistical classification 、 Data mining 、 Emotion recognition 、 Crowdsourcing 、 Feature engineering 、 Algorithm design 、 Annotation 、 Natural language processing 、 Artificial intelligence 、 Computer science 、 Set (abstract data type)
摘要: Research on emotion recognition of tweets focuses feature engineering or algorithm design, while dataset labels are barely questioned. Datasets often labelled manually via crowdsourcing, which results in strong labels. These methods time intensive and can be expensive. Alternatively, tweet hashtags used as free, inexpensive weak This paper investigates the impact using compared to The study uses two label sets for a corpus tweets. weakly annotated set is created employing tweets, by use crowdsourcing. Both separately input five classification algorithms determine performance indicate only 9.25% decrease f1-score when does not outweigh benefits having free