Sentiment Analysis for Financial Applications: Combining Machine Learning, Computational Linguistics, and Statistical Methods for Predicting Stock Price Behavior

作者: Lars Smørås Høysæter , Pål-Christian S Njølstad

DOI:

关键词:

摘要: In this thesis we use sentiment analysis, a classification task within the field of artificial intelligence, for financial applications. Hereunder, combine machine learning, computational linguistics, and statistical methods anticipating stock price behavior ten shares listed on Oslo Stock Exchange (OSE). These predictions have been made basis classifications firm-specific news articles, output by our specially constructed engine, an aggregated market-wide index. The motivation approach comes from being most felicitous source information; in effect widely-read filtering aggregating funnel sentiments. Furthermore, OSE has selected, firstly, its faculty inefficient, compared to peer marketplaces, and, secondly, inherent barriers processing Norwegian language associated with exchange, having meagre linguistic resources. If able surmount these exploit predictive value sentiments, one could potentially attain competitive advantage trading market. constructing named found contextual features be paramount precision addition developed optimized parsimonious lexica construction. Despite lack resources, achieve state-of-the-art using manual annotation. engine make statistically significant return, volume, order size. Positive predominantly, lead increases volume while negative articles predict opposite effect. same is general proclivity For only impact future behavior, ceteris paribus, depreciating subsequent prices. interaction between also significant. Although sign latter seems firm-idiosyncratic, analysis reveal that illiquid stocks exhibit stronger reactions than liquid stocks.

参考文章(13)
Steven Skiena, Wenbin Zhang, Trading Strategies to Exploit Blog and News Sentiment international conference on weblogs and social media. ,(2010)
Jeffrey M. Wooldridge, Introductory Econometrics: A Modern Approach ,(1999)
Y Zhai, Hsu, A Hsu, SK Halgamuge, Combining News and Technical Indicators in Daily Stock Price Trends Prediction international symposium on neural networks. pp. 1087- 1096 ,(2007) , 10.1007/978-3-540-72395-0_132
Songbo Tan, Yuefen Wang, Xueqi Cheng, Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 743- 744 ,(2008) , 10.1145/1390334.1390481
Theresa Wilson, Janyce Wiebe, Paul Hoffmann, Recognizing contextual polarity in phrase-level sentiment analysis Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 347- 354 ,(2005) , 10.3115/1220575.1220619
John S. Uebersax, Diversity of decision-making models and the measurement of interrater agreement. Psychological Bulletin. ,vol. 101, pp. 140- 146 ,(1987) , 10.1037/0033-2909.101.1.140
J. Yi, T. Nasukawa, R. Bunescu, W. Niblack, Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques international conference on data mining. pp. 427- 434 ,(2003) , 10.1109/ICDM.2003.1250949
Jon Atle Gulla, Wei Wei, Sentiment Learning on Product Reviews via Sentiment Ontology Tree meeting of the association for computational linguistics. ,vol. 2010, pp. 404- 413 ,(2010)
J.R. Quinlan, Induction of Decision Trees Machine Learning. ,vol. 1, pp. 81- 106 ,(1986) , 10.1023/A:1022643204877