B04 Sentiment Analysis Flashcards
What is Sentiment Analysis?
Sentiment analysis (or opinion mining) is the process of extracting an author’s emotional intent from text.
The task of sentiment analysis is not only about finding
the opinions about the whole entity but also the opinions about individual attributes of the entity and summarizing them.
Sentiment Analysis
Individual Attributes
aspects
Sentiment Analysis
The person making the opinion
Opinion holder
Sentiment Analysis
The nature of the sentiment expressed
orientation or polarity
Sentiment Analysis
The entity or aspect that the opinion is expressed about
Opinion target
Some attributes of sentiment analysis
1. Does a piece of text represent a positive or a negative sentiment? 2. What are the entities being discussed about, and are they being discussed about in a positive or negative way? 3. What attributes of the entity are discussed, and what are the sentiments expressed about them? 4. What do people think about this candidate or issue?
Challenges of Sentiment Analysis
- Cultural and demographic differences between authors.
- Discerning between feature-specific sentiment.
- Quantifying the hundreds of emotional states which are part of the human condition.
Plutchik emotion
framework classifies 8
evolutionary emotions
Anger, Fear, Anticipation, Surprise, Joy, Sadness, Trust, Disgust
Document Polarity
- Instead of trying to predict emotional states, an easier
approach is to simply state whether a document is positive or negative. - This is referred to as the polarity of a document.
- The approaches to calculating polarity vary and can either be fairly straightforward or rather sophisticated.
Opinion Words
A straightforward approach to calculating polarity involves the use of certain words that are associated with a particular emotional state.
Some examples of opinion words
Opinion words are often adjectives and adverbs (e.g. “good”, “bad”, “excellent”, etc.), although nouns (e.g., “trash”) or verbs (e.g., “annoy”) are also sometimes used.
Opinion or Sentiment Lexicon
- A collection of opinion words along with their polarity form what is known as an opinion or sentiment lexicon.
- Sentiment lexicons are created either through crowd
sourcing or by the labor of an author, and then validated by crowd sourcing or research. - We can calculate the polarity of a document by simply
adding up positive words and subtracting negative words.
Bing
Categorizes words in
a binary fashion into
positive and negative
categories.
AFINN
Assigns words with a score that runs between -5 and 5, with negative scores indicating negative sentiment and positive scores indicating positive sentiment.
NRC
Categorizes words into categories of positive, negative, anger, anticipation, disgust, fear, joy, sadness, surprise, and trust.