Change search
ReferencesLink to record
Permanent link

Direct link
Sentiment Analysis for Financial Applications: Combining Machine Learning, Computational Linguistics, and Statistical Methods for Predicting Stock Price Behavior
Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Computer and Information Science.
Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Computer and Information Science.
2014 (English)MasteroppgaveStudent thesis
Abstract [en]

In this thesis we use sentiment analysis, a classification task within the field of artificial intelligence, for financial applications. Hereunder, we combine machine learning, computational linguistics, and statistical methods for anticipating stock price behavior of ten shares listed on the Oslo Stock Exchange (OSE). These predictions have been made on the basis of sentiment classifications of firm-specific news articles, output by our specially constructed sentiment engine, and an aggregated market-wide sentiment index. The motivation for this approach comes from news being a most felicitous source of financial information; in effect a widely-read filtering and aggregating funnel of sentiments. Furthermore, the OSE has been selected, firstly, for its faculty of being inefficient, compared to peer marketplaces, and, secondly, for the inherent barriers to processing the Norwegian language associated with the exchange, having meagre linguistic resources. If able to surmount these barriers and exploit the predictive value of news sentiments, one could potentially attain a competitive advantage trading in this market. In constructing the named sentiment engine, we have found contextual features to be paramount in classification precision in addition to having developed and optimized a parsimonious approach to sentiment lexica construction. Despite the lack of linguistic resources, we achieve state-of-the-art classification precision in this approach using manual annotation. The engine has been found to make statistically significant predictions on stock return, volume, and order size. Positive articles, predominantly, lead to significant increases in volume while negative articles predict the opposite effect. The same is the general proclivity for order size. For return, only negative articles impact future stock price behavior, ceteris paribus, depreciating subsequent stock prices. The interaction between news articles and market-wide sentiment is also statistically significant. Although the sign of this latter effect seems firm-idiosyncratic, our analysis reveal that illiquid stocks exhibit stronger reactions than liquid stocks.

Place, publisher, year, edition, pages
Institutt for datateknikk og informasjonsvitenskap , 2014. , 187 p.
URN: urn:nbn:no:ntnu:diva-26816Local ID: ntnudaim:10967OAI: diva2:751724
Available from: 2014-10-01 Created: 2014-10-01 Last updated: 2014-10-01Bibliographically approved

Open Access in DiVA

fulltext(6808 kB)958 downloads
File information
File name FULLTEXT01.pdfFile size 6808 kBChecksum SHA-512
Type fulltextMimetype application/pdf
cover(185 kB)12 downloads
File information
File name COVER01.pdfFile size 185 kBChecksum SHA-512
Type coverMimetype application/pdf

By organisation
Department of Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 958 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 281 hits
ReferencesLink to record
Permanent link

Direct link