ISSN Print: 2381-1196  ISSN Online: 2381-120X
International Journal of Investment Management and Financial Innovations  
Manuscript Information
Financial Sentiment Analysis Using Machine Learning Techniques
International Journal of Investment Management and Financial Innovations
Vol.3 , No. 1, Publication Date: Aug. 23, 2017, Page: 1-9
516 Views Since August 23, 2017, 554 Downloads Since Aug. 23, 2017

Sarkis Agaian, Department of Mathematics, New York University Courant Institute, New York, USA.


Petter Kolm, Department of Mathematics, New York University Courant Institute, New York, USA.


The rise of web content has presented a great opportunity to extract indicators of investor moods directly from news and social media. Gauging this sentiment or general prevailing attitude of investors may simplify the analysis of large, unstructured textual datasets and help anticipate price developments in the market. There are several challenges in developing a scalable and effective framework for financial sentiment analysis, including: identifying useful information content, representing unstructured text in a structured format under a scalable framework, and quantifying this structured sentiment data. To address these questions, a corpus of positive and negative financial news is introduced. Various supervised machine learning algorithms are applied to gage article sentiment and empirically evaluate the performance of the proposed framework on introduced media content.


Financial Sentiment, Sentiment Analysis, Text Categorization, Text Classification


X. Hu, J. Tang and H. Liu, "Unsupervised Sentiment Analysis with Emotional Signals," in International World Wide Web Conference Committee, Rio de Janeiro, 2013.


B. Pang and L. Lee, "Opinion mining and sentiment analysis," Foundations and Trends in Information Retrieval, vol. I, no. 2, pp. 1-135, 2008.


B. Connor, R. Balasubramanyan, B. Routledge and N. Smith, "From tweets to polls: Linking text sentiment to public opinion time series," in Proceedings of ICWSM, 2010.


M. Hu and B. Liu, "Mining and summarizing customer reviews," in ACM SIGKDD international conference on Knowledge discovery and data mining, New York, 2004.


X. Hu, N. Sun, C. Zhang and T. Chua, "Exploiting internal and external semantics for the clustering of short texts using world knowledge," in Proceedings of CIKM, 2009.


B. Liu, Handbook of Natural Language Processing, Boca Raton: CRC Press, Taylor and Francis Group, 2010.


B. Pang, L. Lee and S. Vaithyanathan, "Thumbs up?: sentiment classification using machine learning techniques," in Proceedings of ACL, 2002.


J. Wiebe, T. Wilson and C. Cardie, "Annotating expressions of opinions and emotions in language," Language Resources and Evaluation, vol. 39, no. 165, pp. 165-210, 2005.


T. Wilson, J. Wiebe and P. Hoffmann, "Recognizing contextual polarity in phrase-level sentiment analysis," in Proceedings of HLT and EMNLP, 2005.


P. Turney, "Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews," in Proceedings of the Association for Computational Linguistics, 2002.


K. Dave, "Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews," in WWW2003, 2004.


P. Tetlock, M. Saar-Tsechansky and S. Macskassy, "More Than Words: Quantifying Language to Measure Firms," Journal of Finance, vol. 68, pp. 1437-1467, 2008.


P. Tetlock, "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, vol. 62, no. 3, pp. 1139-1168, 2007.


P. Azar, "Sentiment Analysis in Financial News," Harvard College (Thesis), Cambridge, Massachusetts, 2009.


S. Argamon-Engelson, M. Koppel and G. Avneri, "Style-based Text Categorization: What Newspaper Am I Reading?," AAAI, 1998.


B. Kessler, G. Nunberg and H. Schautze, "Automatic Detection of Text Genre," in ACL, 1997.


E. Spertus, "Smokey: Automatic recognition of hostile messages," in Proceedings of Innovative Applications of Artificial Intelligence, 1997.


E. Fama, "Random Walks in Stock Market Prices," Financial Analysts Journal, vol. 21, no. 5, pp. 55-59, 1965.


V. Hatzivassiloglou and K. McKeown, "Predicting the Semantic Orientation of Adjectives," in Proceedings of ACL, 1997.


V. Hatzivassiloglou and J. Wiebe, "Effects of Adjective Orientation and Gradability on Sentence Subjectivity," in Proceedings of International Conference on Computational Linguistics, 2000.


P. Turney and M. Littman, "Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus," 2002.


P. Chaovalit and L. Zhou, "Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches," Proceedings of Annual Hawaii International Conference on System Sciences, 2005.


A. Finn, N. Kushmerick and B. Smyth, "Genre Classification and Domain Transfer for Information Filtering," in Proceedings of European Colloquium on Information Retrieval Research, 2002.


S. Durbin, D. Warner, J. Richter and Z. Gedeon, "Information Self-Service with a Knowledge Base That Learns," AI Magazine, vol. 23, no. 4, pp. 41-50, 2002.


M. Efron, "Cultural orientations: Classifying subjective documents by cocitation analysis," in Proceedings of the AAAI Fall Symposium Series on Style and Meaning in Language, Art, Music, 2004.


D. Inkpen, O. Feiguina and G. Hirst, "Generating more-postive and more-negative text," in Computing Attitude and Affect in Text: Theory and Applications, Dordrecht, The Netherlands, Springer, 2005, pp. 187-196.


M. Gamon, "Sentiment classification on customer feedback data noisy data, large feature vectors, and the role of linguistic analysis," in Proceedings of the 20th international conference on Computational Linguistics, 2004.


J. Wiebe and E. Riloff, " Creating Subjective and Objective Sentence Classifiers from Unannotated Texts," in Computational Linguistics and Intelligent Text Processing, 2005.


T. Wilson., J. Wiebe. and P. Hoffmann, "Recognizing contextual polarity in phrase-level sentiment analysis," Computational Linguistics, vol. 35, no. 3, pp. 399-433, 2009.


Y. Dang, Z. Yulei and H. Chen, "A lexicon enhanced method for sentiment classification: An experiment on online product reviews," IEEE Intelligent Systems, vol. 25, no. 4, pp. 46-53, 2010.


B. Liu, Sentiment Analysis and Opinion Mining, Claypool Publishers, 2012.


P. Tetlock, M. Saar-Tsechansky and S. Macskassy, "More than words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, vol. 63, no. 3, pp. 1437-1467, 2008.


G. Mishne, "Prediciting Movie Sales from Blogger Sentiment," in Computational Approaches to Analysing Weblogs, 2006.


J. Nofsinger, "Social Mood and Financial Economics," Journal of Behavioral Finance, vol. 6, no. 3, pp. 144-160, 2005.


E. Gilbert and K. Karahalios, "Widespread Worry and the Stock Market," in Proceedings of the International, 2010.


I. Bordino, S. Battiston, G. Caldarelli, M. Cristelli, A. Ukkonen and I. Weber, "Web Search Queries Can Predict Stock Market Volumes," PLoS ONE, vol. 7, no. 7, p. e40014, 2012.


M. Thelwall, K. Buckley, G. Paltoglou, D. Cai and A. Kappas, "Sentiment strength detection in short informal text," Journal of the American Society for Information Science and Technology, vol. 61, no. 12, pp. 2544-2558, 2010.


M. Nofer, Using Twitter to Predict the Stock Market: Where is the Mood Effect?, New York: Springer, 2015.


J. Bollen, H. Mao and X. Zeng, "Twitter mood predicts the stock market," Journal of Computational Science, vol. 2, no. 1, pp. 1-8, 2011.


E. J. Ruiz, V. Hristidis, C. Castillo, A. Gionis and A. Jaimes, Correlating financial time series with micro-blogging activity, ACM Press, 2012.


J. Smailovic, M. Grcar and M. Znidaršic, "Sentiment analysis on tweets in a financial domain," in International Postgraduate School Students Conference, 2012.


Y. Lu, M. Castellanos, U. Dayal and C. Zhai, "Automatic construction of a context-aware sentiment lexicon: an optimization," in Proceedings of WWW, 2011.


R. Feldman, "Techniques and Applications for Sentiment Analysis," Communications of the ACM, vol. 56, no. 4, pp. 82-89, 2013.


P. Domingos and M. Pazzani, "On the optimality of the simple Bayesian classifier under zero-one loss," Machine Learning, vol. 29, pp. 103-130, 1997.


T. Joachims, "Text categorization with support vector machines: Learning with many relevant features," in Proceedings of the Tenth European Conference on Machine Learning, Berlin, 1998.


T. Joachims, "Estimating the generalization performance of a SVM efficiently," LS VIII-Report, Universitat Dortmund, Germany, 1999.


B. Schlkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press, 2002.


V. Vapnik, The Nature of Statistical Learning Theory, Springer-Verlag, 1995.


M. Nardo, M. Petracco and M. Naltsidis, "Walking Down Wall Street with a Tablet: A Survey of Stock Market Predictions Using the Web," Journal of Economic Surveys, vol. 30, no. 2, pp. 3556-369, 2016.


B. Agarwal and N. Mitta, " Machine Learning Approach for Sentiment Analysis," in Prominent Feature Extraction for Sentiment Analysis, Springer, 2015, pp. 21-45.


M.-Y. Day and C.-C. Lee, "Deep learning for financial sentiment analysis on finance news providers," in IEEE/ACM International Conference, 2016.


S Das and A. Das, "Fusion with sentiment scores for market research," in Information Fusion International Conference, 2016.


A. Akansu, S. Kulkarni and D. Malioutov, Financial Signal Processing and Machine Learning, John Wiley & Sons, 2016.


D. D. Wu and D. L. Olson, "Financial Risk Forecast Using Machine Learning and Sentiment Analysis," in Enterprise Risk Management in Finance, Palgrave Macmillan, 2015, pp. 32-48.

  Join Us
  Join as Reviewer
  Join Editorial Board