Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features
Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features
Analyzing the sentiment information from the social media interactions is a rapidly growing research area. Several studies in the literature focus on modeling the sentiment information using linguistics, generic word counts and even the contextual information, including the presence of punctuations, elongated words, emoticons, etc. In this paper, we experiment on the effectiveness of lexicon information in combination with other information, for the effective analysis of sentiment in social interactions. The objective of this study is to experimentally verify how senti-lexicons can take part in the process of modeling the sentiment information even in cross-domain sentiment analysis. In general, this paper explores the effectiveness of several feature vectors including the generic Bag of Word (BoW), linguistic (N-Gram and Part-of-Speech (POS)) and the lexicon features (number of positive and negative words). Other than the traditional features we generate hybrid features by combining the lexicon features with the BoW and linguistic features. We conduct the experiments on sentiment classification using supervised models like Linear SVC (L-SVC), Multi-Layer Perceptron (MLP), Multinomial Naïve Bayes (MNB) and Decision Tree (DT). The experiments are conducted on three different types of sentiment document datasets - the Amazon food review dataset, student opinion tweet dataset, and the Large Movie Review Dataset v1.0. We also verify the efficacy of these features in cross-domain sentiment analysis. Experiments show that hybridizing the BoW, linguistic N-Gram and POS method with lexicon features improves the accuracy of sentiment classification even for cross-domain sentiment analysis.
Sentiment Analysis, Machine Learning, Feature extraction, Senti-lexicons
772-777
Rajagopal, Suharshala
0b281bba-0ab3-43fc-89c2-cf67920cf11d
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
V.L., Lajish
e7f39205-51be-4d69-8fc1-4c7b3feddef7
2020
Rajagopal, Suharshala
0b281bba-0ab3-43fc-89c2-cf67920cf11d
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
V.L., Lajish
e7f39205-51be-4d69-8fc1-4c7b3feddef7
Rajagopal, Suharshala, Kadan, Anoop and V.L., Lajish
(2020)
Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features.
In 2018 3rd International Conference on Inventive Computation Technologies (ICICT).
IEEE.
.
(doi:10.1109/ICICT43934.2018.9034272).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Analyzing the sentiment information from the social media interactions is a rapidly growing research area. Several studies in the literature focus on modeling the sentiment information using linguistics, generic word counts and even the contextual information, including the presence of punctuations, elongated words, emoticons, etc. In this paper, we experiment on the effectiveness of lexicon information in combination with other information, for the effective analysis of sentiment in social interactions. The objective of this study is to experimentally verify how senti-lexicons can take part in the process of modeling the sentiment information even in cross-domain sentiment analysis. In general, this paper explores the effectiveness of several feature vectors including the generic Bag of Word (BoW), linguistic (N-Gram and Part-of-Speech (POS)) and the lexicon features (number of positive and negative words). Other than the traditional features we generate hybrid features by combining the lexicon features with the BoW and linguistic features. We conduct the experiments on sentiment classification using supervised models like Linear SVC (L-SVC), Multi-Layer Perceptron (MLP), Multinomial Naïve Bayes (MNB) and Decision Tree (DT). The experiments are conducted on three different types of sentiment document datasets - the Amazon food review dataset, student opinion tweet dataset, and the Large Movie Review Dataset v1.0. We also verify the efficacy of these features in cross-domain sentiment analysis. Experiments show that hybridizing the BoW, linguistic N-Gram and POS method with lexicon features improves the accuracy of sentiment classification even for cross-domain sentiment analysis.
This record has no associated files available for download.
More information
Published date: 2020
Venue - Dates:
International Conference on Inventive Computation Technologies, RVS Technical Campus, Tamil Nadu, India, 2018-11-15 - 2018-11-16
Keywords:
Sentiment Analysis, Machine Learning, Feature extraction, Senti-lexicons
Identifiers
Local EPrints ID: 494591
URI: http://eprints.soton.ac.uk/id/eprint/494591
PURE UUID: ad1038ed-f596-4869-b818-55444226f559
Catalogue record
Date deposited: 10 Oct 2024 17:01
Last modified: 31 Oct 2024 03:15
Export record
Altmetrics
Contributors
Author:
Suharshala Rajagopal
Author:
Anoop Kadan
Author:
Lajish V.L.
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics