The University of Southampton
University of Southampton Institutional Repository

Capturing investor sentiment from Big Data: the effects of online social media on SET50 index

Capturing investor sentiment from Big Data: the effects of online social media on SET50 index
Capturing investor sentiment from Big Data: the effects of online social media on SET50 index
This research aims to introduce a market sentiment index which can be used as a leading indicator for the Stock Exchange of Thailand (SET). This new index is constructed from Big Data extracted from online social media. The data used in this project are the daily firm-level and market data, market capitalizations of firms and the SET, the total trading value in the SET and the values of stocks bought and sold by investor types, the data on S&P global index, the implied volatility index of S&P500 (known as VIX) and the exchange rate of Thai Baht. Meanwhile, tonal words regarding 50 companies listed in the SET50 index are extracted from selected online/social media using search engines and an application programming interface (API). Note that 50 firms included in the analyses and index construction change every six months following the announcement of the SET. The daily data will be collected for the period 2015 - 2018. Machine Learning algorithms are employed to conduct a bag-of-word analysis which will count the number of positive, negative and neutral tonal words for 50 firms in a set of articles over a 4-year period. The findings show that the introduced sentiment index has significant relationships with the changes in SET50 index. The sentiment index can signal changes in SET50 index from day t to day t+3. The analysis results indicate that this sentiment index can act as a leading indicator of the SET50 index and thus the SET index.
Stock price return, Prediction, Sentiment Analysis, Text Mining, NLP, Natural Language Processing
1-42
Tantisantiwong, Nongnuch
73b57288-a4dc-4456-8d1b-12b8d07dc3b4
Komenkul, Kulabutr
34dd9074-4e86-4141-b2bf-3e9cf16309ed
Channuntapipat, Charika
6ff9a719-1863-4fa3-9977-670fe55c7e53
Jeamwatthanachai, Watthanasak
08576ac1-124d-4bfa-8ca2-49e6663161c3
Tantisantiwong, Nongnuch
73b57288-a4dc-4456-8d1b-12b8d07dc3b4
Komenkul, Kulabutr
34dd9074-4e86-4141-b2bf-3e9cf16309ed
Channuntapipat, Charika
6ff9a719-1863-4fa3-9977-670fe55c7e53
Jeamwatthanachai, Watthanasak
08576ac1-124d-4bfa-8ca2-49e6663161c3

Tantisantiwong, Nongnuch, Komenkul, Kulabutr, Channuntapipat, Charika and Jeamwatthanachai, Watthanasak (2020) Capturing investor sentiment from Big Data: the effects of online social media on SET50 index. CM Research Innovation, 2020 (4), 1-42, [1].

Record type: Article

Abstract

This research aims to introduce a market sentiment index which can be used as a leading indicator for the Stock Exchange of Thailand (SET). This new index is constructed from Big Data extracted from online social media. The data used in this project are the daily firm-level and market data, market capitalizations of firms and the SET, the total trading value in the SET and the values of stocks bought and sold by investor types, the data on S&P global index, the implied volatility index of S&P500 (known as VIX) and the exchange rate of Thai Baht. Meanwhile, tonal words regarding 50 companies listed in the SET50 index are extracted from selected online/social media using search engines and an application programming interface (API). Note that 50 firms included in the analyses and index construction change every six months following the announcement of the SET. The daily data will be collected for the period 2015 - 2018. Machine Learning algorithms are employed to conduct a bag-of-word analysis which will count the number of positive, negative and neutral tonal words for 50 firms in a set of articles over a 4-year period. The findings show that the introduced sentiment index has significant relationships with the changes in SET50 index. The sentiment index can signal changes in SET50 index from day t to day t+3. The analysis results indicate that this sentiment index can act as a leading indicator of the SET50 index and thus the SET index.

Text
Capturing Investor Sentiment from Big Data - Version of Record
Available under License Creative Commons GNU GPL (Software).
Download (1MB)

More information

Published date: 20 April 2020
Keywords: Stock price return, Prediction, Sentiment Analysis, Text Mining, NLP, Natural Language Processing

Identifiers

Local EPrints ID: 441178
URI: http://eprints.soton.ac.uk/id/eprint/441178
PURE UUID: 68d43927-dcf3-40ea-80b5-38e0e35b71b1

Catalogue record

Date deposited: 03 Jun 2020 16:32
Last modified: 16 Mar 2024 08:07

Export record

Contributors

Author: Nongnuch Tantisantiwong
Author: Kulabutr Komenkul
Author: Charika Channuntapipat
Author: Watthanasak Jeamwatthanachai

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×