A Comprehensive Review On Sentiment Analysis
A Comprehensive Review On Sentiment Analysis
Abstract Sentiment analysis (SA) is an emerging field in text mining. It is the pro-
cess of computationally identifying and categorizing opinions expressed in a piece
of text over different social media platforms. Social media plays an essential role
in knowing the customer mindset towards a product, services, and the latest mar-
ket trends. Most organizations depend on the customer’s response and feedback to
upgrade their offered products and services. SA or opinion mining seems to be a
promising research area for various domains. It plays a vital role in analyzing big
data generated daily in structured and unstructured formats over the internet. This
survey paper defines sentiment and its recent research and development in different
domains, including voice, images, videos, and text. The challenges and opportunities
of sentiment analysis are also discussed in the paper.
Sudhanshu Kumar
E-mail: skumar2@cs.iitr.ac.in
Partha Pratim Roy
E-mail: partha@cs.iitr.ac.in
Debi Prosad Dogra
E-mail: dpdogra@iitbbs.ac.in
Byung-Gyu Kim
E-mail: bg.kim@sookmyung.ac.kr
∗ Corresponding Author
2 Sudhanshu Kumar∗1 et al.
1 Introduction
1. A large number of literature has been reviewed in sentiment analysis process from
multiple domains and identify the pros and cons of all approaches.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 3
Fig. 1: The process to classify the review into positive, negative, neutral using Ma-
chine learning.
2. Summarizing each of the surveyed articles in detail, including the problems ad-
dressed, dataset details and methods.
3. Analyses of existing applications in order to determine which one is most suitable
for certain application.
4. Discussing the challenges and application of sentiment analysis in order to keep
up the current research trends.
The rest of the paper is organized as follows. In Section 2, we discuss the state of
the art discussion on SA. Detailed discussion on the existing work, open issues, and
possible applications of sentiment analysis is presented in Section 3 and Section 4.
Finally, in the last Section 5 the research work has been concluded.
4 Sudhanshu Kumar∗1 et al.
Fig. 2: The process to classify the review into positive, negative and neutral using
Deep learning techniques.
Fig. 3: All sentiment analysis classification techniques from traditional to the latest
one have been shown in this figure. Initially, It is divided into machine learning and
lexicon-based approach, which further divide into different algorithms.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 5
Fig. 4: The number of research papers with the Scopus index are published in the last
ten years from 2011 to Mid 2020. The number of publications is increasing yearly as
the advancement in technology and the evolution of industry 4.0.
Opinion, views, and feeling are often used interchangeably in the different literature
[70]. SA is also related to many terms such as emotions, moods, and feelings, which
sometimes confuse the reader with opinion or SA. Emotion is related to the percep-
tion of the stimulus and the triggering of the bodily response. For example, one person
shows an angry response when they lose the job, and in another case, they feel joy in
the same situation. Many authors [70, 93] showed that emotion is short term, while
the mood is a long-term phenomenon. They differentiate on both terms basis on their
duration.
A schematic representation of opinion and sentiment are given in Figure 5.
SA has raised a growing interest in financial and political forecasting, e-health,
e-tourism, and dialogue systems. The authors [36] proposed trust-based ranking and
the recommendation tool to improve online software services recommendation. This
system enhances the existing recommendation system (content-based and collabora-
tive filtering based) algorithm by considering the external attributes. The proposed
system result was evaluated on the Amazon marketplace review dataset and showed
a better ranking.
SA used many libraries such as TextBlob 1 and naive Bayes to classify the content
based on polarity score and subjectivity. In [35], the authors proposed a sentiment
polarity categorization process, which was the fundamental problem of sentiment
earlier. The amazon product reviews dataset’s experimental results achieved an F1
score of 0.8 and .0.73 for sentence-level and review-level categorization, respectively.
The polarity shift problem is one of the challenges in sentiment analysis to predict
1 https://textblob.readthedocs.io/en/dev/
6 Sudhanshu Kumar∗1 et al.
Fig. 5: Schematic structure of sentiments [70], which further divide into sentiment
holder, emotional disposition and object of the review. The emotion is short-term,
while the mood is long-term disposition.
user reviews. Xia et al. [114] proposed dual sentiment analysis model to address
the polarity shift problem. The model trained for sentiment-reversed review for both
training and testing. The result of the multi-domain and Chinese datasets showed the
effectiveness of the model.
In [25], the authors proposed a map reducing paradigm to collect the user’s data
from Facebook to understand brand reviews. They refined their approach through
the iterative process of data pre-processing. In [6], the authors proposed an auto-
matic feedback technique based on Twitter data. Different classifiers like SVM, Naive
Bayes, and maximum entropy are used on Twitter comments. Out of these classifiers,
SVM-based performance was the highest. In [102], the authors proposed a random
walk algorithm for domain-oriented sentiment lexicon based on utilizing sentiment
words and documents from both the old and target domains. The proposed algorithm
reflects four kinds of relationships (words to documents, words to words, documents
to words, and documents to documents) between words and documents. Experimen-
tal results indicate improvements in identifying the polarities of sentiment words.
Day et al. [26] presented that analytical methods use deep learning in financial news
sources to forecast stock price trends. The authors found that financial news media
sources can reveal investment information. Sentiment analysis aims to classify text in
positive and negative polarity scores useful in quantifying different affective states of
a user [83]. Cambria et al. [18] have developed an NLP approach that leverages both
data and theory-driven methods to understand natural language.
Many approaches have simple categorization problems; however, sentiment analysis
is a big suitcase problem requiring multiple polarity detection tasks. NLP problems
divide into three layers: syntactic, semantics, and pragmatics. Each layer has a differ-
ent subtask to process each layer’s text output as input for the next layer. In [81], the
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 7
authors have developed a pre-trained model for extracting emotion, sentiment, and
personality features from sarcastic tweets using CNN. Experiments were conducted
on a dataset consisting of both sarcastic and non-sarcastic tweets. Results computed
on three datasets with F1 scores of 87%, 92.32%, and 93.30%, respectively.
Text is mainly an important medium to express the user’s state of mind in reviews
and comments on the internet. It was the primary mode of communication in early
1990 when e-commerce company amazon was the first company to do business on-
line. In 1992, the authors [44] proposed an approach based on the sentence’s direc-
tionality. The approach is based on the semantic orientation of the sentence to de-
termine the directionality of the text. Another researcher [89] whose theory is based
on the information’s subjective point of view. In the early 2000s, many researchers
[23, 69, 77, 106, 107, 112] worked on sentiments analysis and opinions mining. Na-
sukawa et al. [73] showed the high precision result on customer reviews and news ar-
ticles available over web pages. They classified the specific subjects from a document
in positive or negative polarity. This paper’s result rise in interest to other researchers
in this domain. The influential 2008 review of Pang and Lee [76] covers techniques
and approaches that promise to directly enable opinion-oriented information-seeking
systems on benchmark datasets in recent research. Here, we discuss sentiment analy-
sis in NLP, including its different methods, such as supervised and unsupervised.
It is based on the annotated dataset (labeled data) to build a prediction model. This
approach builds a feature vector of the text, either aspect or word frequency, then the
model learns (training) on the dataset and gives prediction for unseen data in testing.
The first paper [111] used this approach to classify the text as subjective or objective
on the gold standard dataset. It achieves 81.5% accuracy on the probabilistic classi-
fier. There are different approaches in machine learning, like the supervised and unsu-
pervised approaches. The supervised approach was used on stock trading [24] domain
to find the sentiment analysis. Further development focus on user comments available
on the E-commerce site. Many machine learning algorithms like SVM, Naive Bayes,
and linear regression solve the problem related to a different domain. SVM was the
most suitable model for product reviews in supervised sentiment analysis.
The authors [30] proposed BERT (bidirectional encoder representations from
transformers) model designed to pre-trained deep bidirectional from the unlabelled
text by jointly conditioning on both left and right context. Bidirectional means that
BERT learns information from both the left and the right side of a token’s context
during the training phase. The model is implemented on eleven NLP tasks such as
GLUE, MultiNLI, SWAG, SQuAD v1.1, etc., and shown the impressive state-of-the-
art result in the SA field. Unlike recent language representation models [80, 86], this
model is used for a wide range of tasks like question answering and language infer-
ence.
8 Sudhanshu Kumar∗1 et al.
In this approach, the labeled data is not present, allowing an estimation based on ex-
pert knowledge. The most popular method in unsupervised learning is cluster analysis
to find the data’s hidden pattern. In sentiment analysis, the lexicon plays an essential
role in classifying the text into positive, negative, and neutral depending on the lexi-
con method, a combination of words or phrases. The most popular lexicon is General
Inquirer [100], it is a corpus of positive and negative terms.
The aim to improve sentence-level classification, recently few methods are per-
forming well such as SentiStrength [104], Valence Aware Dictionary and sEntiment
Reasoner (VADER) [46], and Umigon [59]. VADER is a lexicon and rule-based sen-
timent analysis method used to find the sentiment of the reviews available on dif-
ferent social media platforms. The reviews are shared by a user from different age
groups and gender, so these reviews are not available in the form that can be directly
processed by any method. It converted into a normalized form after pre-processing
of these reviews. VADER evaluates the words and their context on pre-processed re-
views based on a predefined dictionary with many words (sentiment lexicon) and their
corresponding numeric score. The method produces four metrics for each review; the
first three are positive, neutral, and negative. The last metric is a compound score
used to identify these reviews’ sentiment. VADER is popular among other methods
to analyze social media posts with slang, emoticons, and acronyms.
Qazi et al. [85] proposed assessing users’ opinions on multiple topics like a social get-
together, promoting efforts, and item inclinations. This study aims to find the users’
expectations and satisfaction at the post-purchase stage. They surveyed a question-
naire comprising seven sections, and the data was collected through LinkedIn and the
university mail servers. The authors utilized a disconfirmation hypothesis, a set of
seven theories, confirmatory factor analysis, and primary conditions to break down
the users’ information and assess the model. The model’s consequences demonstrated
that regular, comparative, and interesting assessments positively raise users’ desires.
The author presumed that a wide range of sentiments is a rich wellspring of data that
at last influences the customer loyalty level. Wang et al. [108] proposed a SentiRe-
lated algorithm to fill the gap between different domains. The traditional supervised
classification algorithm is performed well for a given domain but does not work well
on different domains. The SentiRelated algorithm is based on the Sentiment Related
Index to improve the model’s performance when tested on other domains. This al-
gorithm was validated on two datasets with different domains such as a computer,
Education, Hotel, Movie, Music, and Book reviews and showed 80% accuracy for
short texts. Social media platforms like Twitter are trending to become a common
platform for exchanging raw data and online text, providing a vast platform for sen-
timent analysis.
The author proposed [75] a novel metaheuristic method based on Cuckoo Search
and K-means (called CSK). It enlightens the clustering-based methods for analysing
Twitter tweets to find the user’s viewpoints and the sentiment pertained while mak-
ing such a tweet. The method proposed outlines to find the optimum cluster-heads
from the Twitter dataset’s sentimental contents. The model tested its efficacy on var-
ious Twitter datasets and then compared it with the existing methods such as particle
swarm optimization, differential evolution, cuckoo search, improved cuckoo search,
10 Sudhanshu Kumar∗1 et al.
etc. This research work performed a basis for designing a system that quickly pro-
vides conclusive reviews on any social issues.
The authors [90] discussed an approach where a publicized tweets from the Twitter
site are processed and classified based on their polarity. In this paper, a new tool is
called ”SENTICIRCLE” is used a lexicon-based approach. The word’s semantics are
extracted from its co-occurrence pattern, and the strength is updated in the lexicon
accordingly. The basic idea of the approach is that the group of word accompany-
ing it decides the semantic of the word in any text. The force of movement from the
static word sentiment orientation approach to this contextual approach derived from
the dictum ”YOU SHALL KNOW THE WORD BY THE COMPANY IT KEEPS.”
It is different from the traditional lexicon, where the words are given fixed static se-
mantics regardless of the context.
Many studies have been conducted in sentiment analysis on English texts, while other
languages have less attention than Arabic, Hindi, Bangla, etc. Many researchers have
worked on SA in different languages after the rise of Web 2.0. The author in [3] intro-
duced an overview on Arabic assessment analysis [8, 4, 97] in which they examined
various tools and applications pertinent to it. The study additionally included both
corpus-based and dictionary-based approaches for different datasets. Microblogs like
tweets are trending rapidly for online users to share their experiences and opinions
daily. In contrast to the online reviews and blogs, these microblogs contain very dis-
persed and incomplete data. Unlike English-based microblogs, Chinese microblogs
such as Sina Weibo have less sentiment analysis. The reason being that Chinese tex-
tual analysis is more challenging than English as its grammar of expression is differ-
ent. The same length of Chinese sentences may contain more data than English, and
the separation of words in those texts is relatively obscure. In totality, textually ana-
lyzing Chinese blogs has three primary research goals: First, the new words mining
and their sentiment inference; second, how to extract other media modules and third,
establish a hierarchical sentiment detection method based on Sina Weibo linguistics.
The authors [110] proposed three primary goals for the analysis of these Chinese
microblogs. They visualize the sentiment analysis’s result, depicting the relationship
between social network sentiments and real-life events.
The researchers already working on Chinese microblogs tend to analyze the topic fo-
cussing on a single attribute while neglecting others. The model design is multilevel
in single-level features keeping all the aspects under consideration. Chen et al. [21]
used to extract the text’s sentiment using sentence-level sentiment analysis, but un-
like other traditional approaches where the same technique was used in all types of
sentences. The sentences are classified into three groups based upon their opinion tar-
gets. There are other ways to classify the sentences that have been previously used in
other research papers. For example, The sentence can be subjective or objective based
upon the subjectivity of the sentences. The subjective sentences express the opinions,
while objective sentences implicate opinions or sentiments. The opinionated targets
focused on the primary sentence classification. This opinion target can be any en-
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 11
tity on which opinion is expressed. These opinionated sentences can give an opinion
without mentioning the target on three different types of sentences: non-target, one-
target, and multi-target. The Bi-LSTM and CNN deep learning approaches were used
to classify the sentences and extract the text’s syntactic and semantic features.
Analysis of speech in search of emotional and affective cues has a comparably long
tradition [28]. This paper proposed statistical pattern recognition techniques to clas-
sify 1000 utterances according to their emotional content. Meanwhile, several kinds
of literature have been established, including a range of recent surveys in emotions
and affect in speech [95]. However, targeting sentiment explicitly exclusively from
spoken utterances is a comparably new field than text-based sentiment analysis. Fo-
cusing on the acoustic side of spoken language, the border between sentiment and
emotion analysis is often fragile, as discussed in [22]. Mairesse et al. [63] focused
on pitch-related features and observed that pitch contains information on sentiment
without textual cues. The authors collected short-spoken reviews from 84 speakers,
and the result outperformed a majority class baseline. This paper attracted other re-
searchers to explore this area to solve real-world problems. The authors [34] created
Arabic Speech Act and Sentiment (ArSAS) dataset. The dataset consisted of 21,064
tweets annotated for two tasks: speech act recognition and SA. Further, the tweets
are annotated for four different sentiment categories: positive, negative, neutral, and
mixed.
Ahmed et al. [2] showed the sentiment in phone calls by first using speech recogni-
tion to extract the text in the call and then use typical text-based SA techniques. The
goal was to measure agent productivity in call centers.
Vision-based emotion recognition [123, 92, 19] is a relatively recent area of research.
Users share millions of images and videos over social media platforms like Twit-
ter, Tumblr, Flickr, and Instagram. These are the most popular sites where celebri-
ties from sports, entertainment, and politics field share information in images. In the
image-based sentiment analysis, opinions depict in the form of cartoons or memes.
In most cases, the information conveyed through images is more effective compared
to other modalities. Multiple techniques and algorithms such as SVM, naive Bayes,
maximum entropy, and deep learning have been proposed in the image-based senti-
ment area to get significant results. The first work introduced by [65] to classify the
images into positive and negative. The author showed that there is a strong correlation
between sentiment images and their visual content.
Further, the SentiWordNet lexicon [74] was used to find the text’s numerical scores
associated with the image. This lexicon is used in WorldNet databases to identify
the positive and negative sentiment of the word. Emotions are difficult to identify
and pin down when discussing the state of the emotion that differentiates from other
12 Sudhanshu Kumar∗1 et al.
emotional states. To find the scientific approach regarding the emotional state of the
human being. The database of different photos was collected and validated against
the specific emotional response of the viewers. This database is called International
Affective Picture System (IAPS). Mikels et al. [65] studied eight emotion output cat-
egories: awe, anger, amusement, contentment, excitement, disgust, sadness, and fear.
The author showed that each emotional state is different as different emotions have
other cognitive and behavioral consequences. This paper adds some new dimensions
of data for IAPS.
Multimodal sentiment analysis [56] performs sentiment analysis from multiple data
such as audio, video, and text. It is the new dimension of traditional text-based
sentiment analysis. Poria et al. [82] proposed a new multimodal sentiment analysis
methodology, which outperformed state of the art by more than 20%. The proposed
system used feature-based fusion techniques on text, visual and audio data from the
youtube dataset. Different classifiers such as Naive Bayes, SVM, and extreme learn-
ing machines are implemented on the youtube dataset. The results showed that the
extreme learning classifier is better than other classifiers. Extreme learning classifiers
have single layer or multiple layers of hidden nodes. In most cases, the weights of the
hidden nodes are learned in a single step so the overall processing time to classifying
the result is less.
Kumar et al. [54] proposed a multimodal rating prediction framework for prod-
ucts to improve customer satisfaction. The forty participants were participated in this
study to collect EEG data of the product. The text’s reviews from the product are
processed through NLP techniques. The customer’s rating from EEG and the prod-
uct’s reviews fused through optimization techniques. The experiment result showed
that the ABC optimization approach was better than the unimodal scheme. There
are many languages other than English, where researchers are working to predict the
sentiment [116, 120, 12, 125, 113]. In [52], the authors proposed a hybrid approach
on the Arabic dataset (Text and audio). Two machine learning approaches were used
on this dataset to find the polarity. The bagging and boosting algorithms were used
to enhance the proposed system further. The summary of the reviewed articles is as
shown in Table 1.
There are many public datasets [41, 105] available in SA. For different application
such as text, images, audio and video, we use different datasets or create a dataset like
Sanskrit dataset [53]. The tools and software library is also depend on the multimodal
data. Open CV is a open source library used in computer vision tasks for object
dection, face recognition and image segmentation. NLTK is a python library used for
understanding the text or speech. Below is some popular database as shown in Table
2.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 13
Table 1: Summary of the publication’s details included author, approach, data set, and
accuracy.
Author & Year Approach Dataset Accuracy (%)
Hearst et al. [14], 1992 Cognitive Linguistics User’s Query -
Wiebet et al. [21], 1999 Probabilistic Classifier Gold-standard -
Camera reviews,
Nasukawa et al. [22], 2003 Sentiment Lexicon 75-95
news articles
Supervised and Unsupervised
Pang et al. [23], 2008 - -
approach (Survey paper)
Electronics reviews,
Domain-Oriented
Tan et al. [11], 2011 Stock reviews 82.9
sentiment lexicon
and Hotel review
sentiment polarity amazon product
Fang et al. [9], 2015 80
categorization process reviews
Amazon
Trust-based ranking
Gallege et al. [8], 2016 marketplace -
and the recommendation
review
Naive Bayes, linear SVM, Multi-Domain and
Xia et al. [10], 2015 90
logistic regression Chinese dataset
1500 Arabic
Khasawneh et al. [12], 2015 Bagging and Boosting comments and -
Twitter reviews
Campos et al. [65], 2017 CNN Twitter images -
Poria et al. [13], 2016 ELM classifier YouTube Dataset -
Penn Treebank,
Cambria et al. [1], 2017 Top-Down and Bottom-Up -
LIWC
EEG data and
Kumar et al. [4], 2019 ABC optimization -
product reviews
Expectancy disconfirmation LinkedIn and
Qazi et al. [47], 2017 theory and Confirmatory the university mail -
factor analysis servers’ groups
Raw Data and
Wang et al. [48], 2018 SentiRelated 80
Douban Data
intermediate-level
Williams et al. [48], 2018 MOSI dataset 74.0
feature fusion
Accuracy 93.5
book reviews
Precision 93
Yang et al. [48], 2020 SLCABG collected from
Recall 93.6
Dangdang dataset
F1 93.3
Precision 91.54
15000 hotel
Xu et al. [48], 2019 Seninfo+TF-IDF Recall 92.82
comment texts
F1 92.18
Multimodal Corpus of
Lakomkin et al. [48], 2019 ASR model 73.6
Sentiment Intensity
CNN-BiGRU-CTC + Aishell-1
Guo et al. [48], 2022 94.5
ERNIE-BiLSTM and NLPCC 2014
Kumar et al. [48], 2022 BiLSTM + GloVe IIT-R STSA 92.83
BERT-LARGE + A-KVMN LAP14, REST14,
Tian et al. [48], 2021 with second-order REST15, REST16 92.48
word dependencies and Twitter
Movie review,
Behera et al. [48], 2021 Co-LSTM model Airline dataset 98.40
Self driving car GOP
Facebook corpus Precision 57.95
Attention-based
Zhao et al. [48], 2020 containing user Recall 65.78
LSTM model
personality tag F1 72.2
English dataset
average 56.24
Derakhshan et al. [48], 2019 LDA-POS model English and Persian
Persian dataset
average 55.33
This section covers the wide range of applications of sentiment analysis in various
emerging areas.
Table 2: List of most popular public datasets available in the sentiment analysis field
in different languages and modalities. The details included the source and uses of the
dataset like the movie reviews, product reviews, social media data, etc.
Research Paper Dataset Use of this dataset Volume
Bai et al. [10] IMDB Movie To analysis movie review 50,000 movie reviews
Google 640917,
To do the sentiment analysis Microsoft 161292
Li et al. [60] Twitter
of tweets of different domain and Sony 141529
tweets
Sentiment analysis of tweets
Araque et al. [7] Sentiment140 1.6 million tweets
for a product or brand
Amazon Product To classify user review in
Dredze et al. [15] 142.8 million review
Reviews positive and negative
To find the aspect based 3 million restaurant
Qian et al. [58] Restaurant Reviews
sentiment analysis reviews
To do the sentiment analysis million Facebook users
Karyotis et al. [51] Facebook
of Facebook post and their posts
470 positive tweets
Chen et al. [115] Flicker images To classify the image and 133 negative tweets,
Tumblr 1179
IAPS 395,
Yang et al. [119] IAPS, Instagram visual sentiment prediction
Instagram 23308
Restaurants 3841,
Yang et al. [118] SemEval 2014 Aspect-based sentiment analysis
Laptops 3845
Positive 3094,
Zhang et al. [124] SemEval 2016 Sentiment analysis track Negative 2043
and Neutral 863 tweets
Detecting sentiment,
Schmitt et al. [94] SemEval 2017 8000-10000 tweets
humour, and truth
Joshi et al. [48] Hindi Movie Reviews Sentiment analysis in Hindi 250 Hindi Movie Reviews
Reviews of hotel
Chinese Text Sentiment
Xu et al. [116] clothes, fruit 2,50000 reviews
Analysis
digital etc
The Multimodal
Stappen et al. [99] MuSe-CAR Sentiment Analysis 15 GB Audio, Video, Text
in Car Reviews
4 emotions: angry,
Latif et al. [57] URDU-Dataset 0.072 GB Audio
happy, neutral, and sad.
6 emotions provides
single-word utterances
Duville et al. [33] MESD 0,097 GB Audio
for anger, disgust, fear
happiness, neutral, and sadness.
[1, 64, 5, 20, 72] pandemic. Therefore, there is a severe demand for a system based
on sentiment analysis that can extract sentiments about a particular product, item, or
service. It will help us to automate the user feedback or customer rating for the given
product, services, etc. This would help to improve the product and offered services
and eventually serve both the buyer and seller’s requirements.
Nowadays, consumers are getting more intelligent [98], quality-conscious, and tech-
nical savvy; therefore, they tend to seek out the reviews and ratings of online prod-
ucts and services before buying them. Many companies like Uber [11], Oyo [96],
and zomato [42] use digital transformation models to take feedback from the cus-
tomer. The online customer opinion decides the success or failure of their offered
services and products. The companies demand to extract sentiment from the online
user reviews to enhance their offered products and services. It also helps companies to
launch their new products and services in the new market for target customers. There-
fore, It is evident that sentiment analysis plays a vital role in getting the customer and
competition insights, which help companies make corrective and preventive actions
to sustain and grow their businesses in the digital era.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 15
SA is also helpful in the share market and the Federal open market committee (FOMC
2
) statement [101, 14, 31] to extract the meaningful information for traders through
which they understand the global financial markets 3 . Some interesting trends reveal
in Figure 6, Figure 7 and Figure 8 through sentiment analysis of FOMC statements.
Fig. 6: The federal open market committee (FOMC) controls the monetary policy of
the central bank. The FOMC’s statement lexical frequency list (most popular word in
the report) of July and September 2017.
Smart homes are an emerging technology, and in the near future, the entire home will
be more secure and better connected with other home appliances. The people would
control and manage any part of the house using smart wearable devices such as apple
watch, intelligent assistant devices such as Alexa [16, 37], Google Home [91, 78],
etc. Recently there has been a lot of research going on in the Internet of Things (IoT)
and the SA. The SA also found its way in IoT, e.g., the connected home using smart
devices such as smart bulbs, smart music devices could alter its ambiance to create a
calming and comfortable environment based on the sentiment or emotion of the user.
2 https://www.federalreserve.gov/monetarypolicy/fomccalendars.htm
3 https://www.forbes.com/sites/alapshah/2017/09/22/sentiment-analysis-of-fomc-statements-reveals-a-
more-hawkish-fed/?sh=4079d5a8632e
16 Sudhanshu Kumar∗1 et al.
Fig. 7: In the list of positive and negative words in the FOMC’s statement of July
and September 2017, the words in bold font denote the negative word while the other
indicates the positive word.
Bigotry speech [40, 9, 88] is used to express repugnance towards a specifically in-
tended community, group, or person that can cause a dangerous situation to the vic-
tim. It can also be used to demean or offend particular community members or groups
on any social media. SA based detection system would help the social media compa-
nies such as Twitter [47], Instagram [71], etc., instant messaging companies such as
WhatsApp [27], Telegram and local enforcement and government to suppress hate
speech and fake news towards a specific person, sex, religion, race, country, etc.
which in turn improve their reputation and bring harmony in the community.
In modern society, suicides are rising rapidly in recent times; it is critical to find a
faster way to fine-grained emotion detection [38, 29, 84] and anxiety in online posts,
microblogging text in the form of tweets by these troubled individuals. The SA-based
detection and analysis system may help to detect such tendencies upfront and prevent
suicides.
On the flip side of excessive competition, improving the living style in a fast-moving
world, people typically face many changes from their work environment, eating habits,
etc. The body reacts to these stress changes, influencing an individual’s emotional,
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 17
Fig. 8: Most commonly used words in the FOMC statements since 2012, where n is
the occurrence of words. In this chart, the most frequent word has shown from top to
bottom; the top word in the chart has a higher occurrence than the last word.
mental, and physical health. The SA based detection system may help to detect stress
symptoms [109, 49] upfront and prevent any adverse impact due to this.
18 Sudhanshu Kumar∗1 et al.
The SA is a particularly challenging task for human behaviors and subjective senti-
ments. Below are a few of the challenges-
The same phrase might have different interpretations in different domains in which it
is being used. For Example, the word ’unpredictable’ is positive in entertainment and
theater, etc., but if the same word is used in the context of an automobile’s break, it
has a negative opinion. Still, this is challenging to identify the domain from which any
word is related correctly. Different pre-trained word embedding corpus domains such
as IMDB movie reviews corpus and customer reviews dataset classify the sentence
correctly. This challenge is still not solved completely, and researchers are continu-
ously working on this problem.
Sarcastic sentences express a negative opinion about a person or thing using positive
words in unique. Often, people use it to say the opposite of what’s true to make
someone look or feel foolish. For Example: -” Good perfume. You must marinate
in it for long”. The sentence has only positive words, but it expresses a negative
sentiment.
Discourse Structure analysis is essential for opinion mining and sentiment analysis.
For Example, A is better than B conveys the exact opposite opinion from B is better
than A. For finding SA for these kind of sentence is quite challenging.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 19
4.5 Idioms
ML programs are designed so that they don’t understand a figure of speech. For ex-
ample, language such as ”not my cup of tea” will disrupt the algorithm because it
understands the things literally. When any user uses idioms in a comment or review,
the sentence interpretation is not correctly map by the algorithm. The situation is even
more difficult if the comment is multilingual.
User share their opinion in different languages like Hinglish which is the combina-
tion of Hindi and English. Every language has its own lemmatizer, POS tagger and
grammatical constructs so ML or Deep learning algorithm understand the context
and classify the comment in positive and negative. The real challenges is that we can
not translate multiple language into one base language. Usually in micro-blogging or
chatting, user share their feeling in multilingual.
Despite different challenges in sentiment analysis still, it is an emerging field
among customers for decision-making. Figure 9 and 10 presents users’ most popular
topic and query search from 2004 to 2020.
Fig. 9: The top 25 topics search by the user worldwide from 2004 to 2020 in the
sentiment analysis field. The most frequent topics search by the users is analysis and
opinion.
20 Sudhanshu Kumar∗1 et al.
Fig. 10: The most frequently searched query worldwide from 2004 to 2020 in the
sentiment analysis field. The sentiment, sentiment analysis, and Twitter sentiment
are the top three search queries worldwide.
In the last decade, the paradigm shifted from machine learning to deep learning tech-
niques. In-text data, the context problem is a big challenge to understand the sen-
tence’s meaning through the ML algorithm correctly. This problem solves through
pre-trained word embedding and the VADER approach even we have a smaller train-
ing dataset. However, the pre-trained word embedding corpus was trained on the
google news dataset (100 billion words) and IMDB movie dataset. It shows a good
result when the data are related to the pre-trained corpus domain; otherwise, it will
not predict the result as expected. The BERT model is the start of the art model in
NLP. It uses the bidirectional training of the input, which provides a more profound
sense of the language context. However, it is very compute-intensive and takes time
to predict the result. ML techniques are also predicted good results, depending on the
dataset and nature of data. DL techniques predict a good outcome for a large dataset.
The present study covered different domains like text, speech, image, and video
to analyze sentiment. In all domains, the start of the art algorithms and papers were
discussed in the study.
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 21
With the advancement of technology in machine learning and deep learning, the SA
plays a vital role in analyzing data available on the internet in text, image, and speech.
The SA is computationally identifying the polarity of text into a positive, negative,
and neutral review. In this survey paper, we have investigated the history of the SA
and its impact on the research community from the years 2000 to current trends. In the
last five years, most articles are related to social media such as Facebook, Instagram,
and Twitter. Most articles are related to the application area of health, restaurant,
travel, spam, and politics. We have also included the top-cited paper and discuss the
research challenges and perspectives suitable for new researchers who want to start
research in the ML, NLP, and SA fields. We also cover in detail about global fina-
cial market (FOMC), different languages like Sanskrit, Hindi, Arabic, Chinese etc
and modalities in which many authors used SA. In future work, the SA is combined
with network traffic to detect fake opinion or news, which creates a serious prob-
lem, resulting in mob violence. The method to do the SA will also improve with the
continuous advancement of the NLP and ML fields.
Conflict of interest The authors declared that they have no conflicts of interest to this
work.
References
23. Das S, Chen M (2001) Yahoo! for amazon: Extracting market sentiment from
stock message boards. In: Proceedings of the Asia Pacific finance association
annual conference (APFA), Bangkok, Thailand, vol 35, p 43
24. Das SR, Chen MY (2007) Yahoo! for amazon: Sentiment extraction from small
talk on the web. Management science 53(9):1375–1388
25. Dasgupta SS, Natarajan S, Kaipa KK, Bhattacherjee SK, Viswanathan A (2015)
Sentiment analysis of facebook data using hadoop based open source technolo-
gies. In: Proceedings of the IEEE International Conference on Data Science and
Advanced Analytics (DSAA), pp 1–3
26. Day MY, Lee CC (2016) Deep learning for financial sentiment analysis on fi-
nance news providers. In: Proceedings of the IEEE/ACM International Confer-
ence on Advances in Social Networks Analysis and Mining (ASONAM), pp
1127–1134
27. Deb K, Paul S, Das K (2020) A framework for predicting and identifying radi-
calization and civil unrest oriented threats from whatsapp group. In: Emerging
Technology in Modelling and Graphics, Springer, pp 595–606
28. Dellaert F, Polzin T, Waibel A (1996) Recognizing emotion in speech. In: Pro-
ceeding of Fourth International Conference on Spoken Language Processing.
ICSLP’96, IEEE, vol 3, pp 1970–1973
29. Desmet B, Hoste V (2013) Emotion detection in suicide notes. Expert Systems
with Applications 40(16):6351–6358
30. Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of
deep bidirectional transformers for language understanding. arXiv preprint
arXiv:181004805
31. Doh T, Kim S, Yang SK (2021) How you say it matters: Text analysis of fomc
statements using natural language processing. Economic Review-Federal Re-
serve Bank of Kansas City 106(1):25–40
32. Du C, Huang L (2018) Text classification research with attention-based recur-
rent neural networks. International Journal of Computers Communications &
Control 13(1):50–61
33. Duville MM, Alonso-Valerdi LM, Ibarra-Zarate DI (2021) The mexican emo-
tional speech database (mesd): elaboration and assessment based on machine
learning. In: 2021 43rd Annual International Conference of the IEEE Engineer-
ing in Medicine & Biology Society (EMBC), IEEE, pp 1644–1647
34. Elmadany A, Mubarak H, Magdy W (2018) Arsas: An arabic speech-act and
sentiment corpus of tweets. OSACT 3:20
35. Fang X, Zhan J (2015) Sentiment analysis using product review data. Journal
of Big Data 2(1):5
36. Gallege LS, Raje RR (2016) Towards selecting and recommending online soft-
ware services by evaluating external attributes. In: Proceedings of the 11th An-
nual Cyber and Information Security Research Conference (CISRC), pp 23:1-
23:4
37. Gao Y, Pan Z, Wang H, Chen G (2018) Alexa, my love: analyzing reviews of
amazon echo. In: 2018 IEEE SmartWorld, Ubiquitous Intelligence & Comput-
ing, Advanced & Trusted Computing, Scalable Computing & Communications,
Cloud & Big Data Computing, Internet of People and Smart City Innovation
24 Sudhanshu Kumar∗1 et al.
101. Tadle RC (2022) Fomc minutes sentiments and their impact on financial mar-
kets. Journal of economics and business 118:106021
102. Tan S, Wu Q (2011) A random walk algorithm for automatic construc-
tion of domain-oriented sentiment lexicon. Expert Systems with Applications
38(10):12094–12100
103. Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-
specific word embedding for twitter sentiment classification. In: Proceedings
of the 52nd Annual Meeting of the Association for Computational Linguistics
(Volume 1: Long Papers), pp 1555–1565
104. Thelwall M, Buckley K, Paltoglou G, Cai D, Kappas A (2010) Sentiment
strength detection in short informal text. Journal of the American society for
information science and technology 61(12):2544–2558
105. Tian Y, Chen G, Song Y (2021) Enhancing aspect-level sentiment analysis with
word dependencies. In: Proceedings of the 16th Conference of the European
Chapter of the Association for Computational Linguistics: Main Volume, pp
3726–3739
106. Tong RM (2001) An operational system for detecting and tracking opinions in
on-line discussion. In: Working Notes of the ACM SIGIR 2001 Workshop on
Operational Text Classification, vol 1
107. Turney PD, Littman ML (2003) Measuring praise and criticism: Inference of
semantic orientation from association. ACM Transactions on Information Sys-
tems (TOIS) 21(4):315–346
108. Wang L, Niu J, Song H, Atiquzzaman M (2018) Sentirelated: A cross-domain
sentiment classification algorithm for short texts through sentiment related in-
dex. Journal of Network and Computer Applications 101:111–119
109. Wang X, Zhang C, Ji Y, Sun L, Wu L, Bao Z (2013) A depression detection
model based on sentiment analysis in micro-blog social network. In: Pacific-
Asia Conference on Knowledge Discovery and Data Mining, Springer, pp 201–
213
110. Wang Z, Yu Z, Chen L, Guo B (2014) Sentiment detection and visualization
of chinese micro-blog. In: 2014 International Conference on Data Science and
Advanced Analytics (DSAA), IEEE, pp 251–257
111. Wiebe J, Bruce R, O’Hara TP (1999) Development and use of a gold-standard
data set for subjectivity classifications. In: Proceedings of the 37th annual meet-
ing of the Association for Computational Linguistics, pp 246–253
112. Wiebe J, et al (2000) Learning subjective adjectives from corpora. Aaai/iaai
20(0):0
113. Williams J, Comanescu R, Radu O, Tian L (2018) Dnn multimodal fusion tech-
niques for predicting video sentiment. In: Proceedings of grand challenge and
workshop on human multimodal language (Challenge-HML), pp 64–72
114. Xia R, Xu F, Zong C, Li Q, Qi Y, Li T (2015) Dual sentiment analysis: Con-
sidering two sides of one review. IEEE transactions on knowledge and data
engineering 27(8):2120–2133
115. Xu C, Cetintas S, Lee KC, Li LJ (2014) Visual sentiment prediction with deep
convolutional neural networks. arXiv pp arXiv–1411
A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications 29