The University of Southampton
University of Southampton Institutional Repository

Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features

Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features
Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features
Analyzing the sentiment information from the social media interactions is a rapidly growing research area. Several studies in the literature focus on modeling the sentiment information using linguistics, generic word counts and even the contextual information, including the presence of punctuations, elongated words, emoticons, etc. In this paper, we experiment on the effectiveness of lexicon information in combination with other information, for the effective analysis of sentiment in social interactions. The objective of this study is to experimentally verify how senti-lexicons can take part in the process of modeling the sentiment information even in cross-domain sentiment analysis. In general, this paper explores the effectiveness of several feature vectors including the generic Bag of Word (BoW), linguistic (N-Gram and Part-of-Speech (POS)) and the lexicon features (number of positive and negative words). Other than the traditional features we generate hybrid features by combining the lexicon features with the BoW and linguistic features. We conduct the experiments on sentiment classification using supervised models like Linear SVC (L-SVC), Multi-Layer Perceptron (MLP), Multinomial Naïve Bayes (MNB) and Decision Tree (DT). The experiments are conducted on three different types of sentiment document datasets - the Amazon food review dataset, student opinion tweet dataset, and the Large Movie Review Dataset v1.0. We also verify the efficacy of these features in cross-domain sentiment analysis. Experiments show that hybridizing the BoW, linguistic N-Gram and POS method with lexicon features improves the accuracy of sentiment classification even for cross-domain sentiment analysis.
772-777
IEEE
Rajagopal, Suharshala
0b281bba-0ab3-43fc-89c2-cf67920cf11d
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
V.L., Lajish
e7f39205-51be-4d69-8fc1-4c7b3feddef7
Rajagopal, Suharshala
0b281bba-0ab3-43fc-89c2-cf67920cf11d
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
V.L., Lajish
e7f39205-51be-4d69-8fc1-4c7b3feddef7

Rajagopal, Suharshala, Kadan, Anoop and V.L., Lajish (2020) Cross-domain sentiment analysis on social media interactions using senti-lexicon based hybrid features. In 2018 3rd International Conference on Inventive Computation Technologies (ICICT). IEEE. pp. 772-777 . (doi:10.1109/ICICT43934.2018.9034272).

Record type: Conference or Workshop Item (Paper)

Abstract

Analyzing the sentiment information from the social media interactions is a rapidly growing research area. Several studies in the literature focus on modeling the sentiment information using linguistics, generic word counts and even the contextual information, including the presence of punctuations, elongated words, emoticons, etc. In this paper, we experiment on the effectiveness of lexicon information in combination with other information, for the effective analysis of sentiment in social interactions. The objective of this study is to experimentally verify how senti-lexicons can take part in the process of modeling the sentiment information even in cross-domain sentiment analysis. In general, this paper explores the effectiveness of several feature vectors including the generic Bag of Word (BoW), linguistic (N-Gram and Part-of-Speech (POS)) and the lexicon features (number of positive and negative words). Other than the traditional features we generate hybrid features by combining the lexicon features with the BoW and linguistic features. We conduct the experiments on sentiment classification using supervised models like Linear SVC (L-SVC), Multi-Layer Perceptron (MLP), Multinomial Naïve Bayes (MNB) and Decision Tree (DT). The experiments are conducted on three different types of sentiment document datasets - the Amazon food review dataset, student opinion tweet dataset, and the Large Movie Review Dataset v1.0. We also verify the efficacy of these features in cross-domain sentiment analysis. Experiments show that hybridizing the BoW, linguistic N-Gram and POS method with lexicon features improves the accuracy of sentiment classification even for cross-domain sentiment analysis.

This record has no associated files available for download.

More information

Published date: 2020
Venue - Dates: International Conference on Inventive Computation Technologies, RVS Technical Campus, Tamil Nadu, India, 2018-11-15 - 2018-11-16

Identifiers

Local EPrints ID: 494591
URI: http://eprints.soton.ac.uk/id/eprint/494591
PURE UUID: ad1038ed-f596-4869-b818-55444226f559
ORCID for Anoop Kadan: ORCID iD orcid.org/0000-0002-4335-5544

Catalogue record

Date deposited: 10 Oct 2024 17:01
Last modified: 11 Oct 2024 02:10

Export record

Altmetrics

Contributors

Author: Suharshala Rajagopal
Author: Anoop Kadan ORCID iD
Author: Lajish V.L.

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×