Emerging directions in predictive text mining

作者: Nitin Indurkhya

DOI: 10.1002/WIDM.1154

关键词:

摘要: In recent years, Text Mining has seen a tremendous spurt of growth as data scientists focus their attention on analyzing unstructured data. The main drivers for this have been big well complex applications where the information in text is often combined with other kinds building predictive models. These require highly efficient and scalable algorithms to meet overall performance demands. context, six directions are identified research mining heading: Deep Learning, Topic Models, Graphical Modeling, Summarization, Sentiment Analysis, Learning from Unlabeled Text. Each direction its own motivations goals. There some overlap concepts because common themes prediction. models involved typically ones that involve meta-information or tags could be added text. can then used processing tasks such extraction. While boundary between fields Natural Language Processing becoming increasingly blurry, importance various involving means there still substantial potential within traditional sub-fields mining. data-centric also likely influence future Processing, especially resource-poor languages multilingual texts. WIREs Data Knowl Discov 2015, 5:155-164. doi: 10.1002/widm.1154

参考文章(78)
Yoshua Bengio, Xavier Glorot, Antoine Bordes, Antoine Bordes, Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach international conference on machine learning. pp. 513- 520 ,(2011)
Richard Socher, Andrew Y. Ng, Eric H. Huang, Christopher D. Manning, Jeffrey Pennington, Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions empirical methods in natural language processing. pp. 151- 161 ,(2011)
Natasa Milic-Frayling, Marko Grobelnik, Jure Leskovec, Impact of linguistic analysis on the semantic graph coverage and learning of document extracts national conference on artificial intelligence. pp. 1069- 1074 ,(2005)
Xia Hu, Jiliang Tang, Huiji Gao, Huan Liu, Unsupervised sentiment analysis with emotional signals Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 607- 618 ,(2013) , 10.1145/2488388.2488442
Emmanouil Antonios Platanios, Avrim Blum, Tom Mitchell, None, Estimating accuracy from unlabeled data uncertainty in artificial intelligence. ,vol. 2014, pp. 682- 691 ,(2014)
B. Ravindran, M. Saravanan, S. Raman, Improving Legal Document Summarization Using Graphical Models international conference on legal knowledge and information systems. pp. 51- 60 ,(2006)
David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams, Learning representations by back-propagating errors Nature. ,vol. 323, pp. 696- 699 ,(1988) , 10.1038/323533A0