Capturing investor sentiment from Big Data: the effects of online social media on SET50 index
Capturing investor sentiment from Big Data: the effects of online social media on SET50 index
This research aims to introduce a market sentiment index which can be used as a leading indicator for the Stock Exchange of Thailand (SET). This new index is constructed from Big Data extracted from online social media. The data used in this project are the daily firm-level and market data, market capitalizations of firms and the SET, the total trading value in the SET and the values of stocks bought and sold by investor types, the data on S&P global index, the implied volatility index of S&P500 (known as VIX) and the exchange rate of Thai Baht. Meanwhile, tonal words regarding 50 companies listed in the SET50 index are extracted from selected online/social media using search engines and an application programming interface (API). Note that 50 firms included in the analyses and index construction change every six months following the announcement of the SET. The daily data will be collected for the period 2015 - 2018. Machine Learning algorithms are employed to conduct a bag-of-word analysis which will count the number of positive, negative and neutral tonal words for 50 firms in a set of articles over a 4-year period. The findings show that the introduced sentiment index has significant relationships with the changes in SET50 index. The sentiment index can signal changes in SET50 index from day t to day t+3. The analysis results indicate that this sentiment index can act as a leading indicator of the SET50 index and thus the SET index.
Stock price return, Prediction, Sentiment Analysis, Text Mining, NLP, Natural Language Processing
1-42
Tantisantiwong, Nongnuch
73b57288-a4dc-4456-8d1b-12b8d07dc3b4
Komenkul, Kulabutr
34dd9074-4e86-4141-b2bf-3e9cf16309ed
Channuntapipat, Charika
6ff9a719-1863-4fa3-9977-670fe55c7e53
Jeamwatthanachai, Watthanasak
08576ac1-124d-4bfa-8ca2-49e6663161c3
20 April 2020
Tantisantiwong, Nongnuch
73b57288-a4dc-4456-8d1b-12b8d07dc3b4
Komenkul, Kulabutr
34dd9074-4e86-4141-b2bf-3e9cf16309ed
Channuntapipat, Charika
6ff9a719-1863-4fa3-9977-670fe55c7e53
Jeamwatthanachai, Watthanasak
08576ac1-124d-4bfa-8ca2-49e6663161c3
Tantisantiwong, Nongnuch, Komenkul, Kulabutr, Channuntapipat, Charika and Jeamwatthanachai, Watthanasak
(2020)
Capturing investor sentiment from Big Data: the effects of online social media on SET50 index.
CM Research Innovation, 2020 (4), , [1].
Abstract
This research aims to introduce a market sentiment index which can be used as a leading indicator for the Stock Exchange of Thailand (SET). This new index is constructed from Big Data extracted from online social media. The data used in this project are the daily firm-level and market data, market capitalizations of firms and the SET, the total trading value in the SET and the values of stocks bought and sold by investor types, the data on S&P global index, the implied volatility index of S&P500 (known as VIX) and the exchange rate of Thai Baht. Meanwhile, tonal words regarding 50 companies listed in the SET50 index are extracted from selected online/social media using search engines and an application programming interface (API). Note that 50 firms included in the analyses and index construction change every six months following the announcement of the SET. The daily data will be collected for the period 2015 - 2018. Machine Learning algorithms are employed to conduct a bag-of-word analysis which will count the number of positive, negative and neutral tonal words for 50 firms in a set of articles over a 4-year period. The findings show that the introduced sentiment index has significant relationships with the changes in SET50 index. The sentiment index can signal changes in SET50 index from day t to day t+3. The analysis results indicate that this sentiment index can act as a leading indicator of the SET50 index and thus the SET index.
Text
Capturing Investor Sentiment from Big Data
- Version of Record
More information
Published date: 20 April 2020
Keywords:
Stock price return, Prediction, Sentiment Analysis, Text Mining, NLP, Natural Language Processing
Identifiers
Local EPrints ID: 441178
URI: http://eprints.soton.ac.uk/id/eprint/441178
PURE UUID: 68d43927-dcf3-40ea-80b5-38e0e35b71b1
Catalogue record
Date deposited: 03 Jun 2020 16:32
Last modified: 05 Jun 2024 17:55
Export record
Contributors
Author:
Nongnuch Tantisantiwong
Author:
Kulabutr Komenkul
Author:
Charika Channuntapipat
Author:
Watthanasak Jeamwatthanachai
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics