作者: Vineet Mahajan , Thu Kyaw , Sang Chui Song , Elena Haliczer
DOI:
关键词:
摘要: Computerized systems and methods are disclosed for performing contextual classification of objects using supervised unsupervised training. In accordance with one implementation, content reviewers may review training submit data preprocessing analysis. The be preprocessed to identify key terms phrases, such as by stemming, tokenization, or n-gram analysis, form vectorized objects. used train more models subsequent certain implementations, training, among other steps, performed in parallel over multiple machines improve efficiency. a wide variety applications, article moderation.