作者: Sencun Zhu , Zhi Xu
DOI:
关键词:
摘要: Offensive language has arisen to be a big issue the health of both online communities and their users. To community, spread offensive undermines its reputation, drives users away, even directly affects growth. users, viewing brings negative influence mental health, especially for children youth. When is detected in user message, problem arises about how should removed, i.e. filtering problem. solve this problem, manual approach known produce best result. However, costly time labor thus can not widely applied. In paper, we analyze text messages posted communities, propose new automatic sentence-level that able semantically remove by utilizing grammatical relations among words. Comparing with existing approaches, proposed provides results much closer filtering. demonstrate our work, created dataset manually over 11,000 comments from YouTube website. Experiments on show 90% agreement filtered between approach. Moreover, overhead applying reasonable, making it practical adopted real life applications.