- Instead of trying to predict emotional states, an easier approach is to simply state whether a document is positive or negative. - This is referred to as the polarity of a document. - The approaches to calculating polarity vary and can either be fairly straightforward or rather sophisticated.

B04 Sentiment Analysis Flashcards by Nicholas Kouretas

What is Sentiment Analysis?

Sentiment analysis (or
opinion mining) is the
process of extracting an
author’s emotional intent
from text.

The task of sentiment analysis is not only about finding
the opinions about the whole entity but also the opinions about individual attributes of the entity and summarizing them.

How well did you know this?

Not at all

Perfectly

Sentiment Analysis

Individual Attributes

aspects

How well did you know this?

Not at all

Perfectly

Sentiment Analysis

The person making the opinion

Opinion holder

How well did you know this?

Not at all

Perfectly

Sentiment Analysis

The nature of the sentiment expressed

orientation or polarity

How well did you know this?

Not at all

Perfectly

Sentiment Analysis

The entity or aspect that the opinion is expressed about

Opinion target

How well did you know this?

Not at all

Perfectly

Some attributes of sentiment analysis

1. Does a piece of text represent a positive
or a negative sentiment?
2. What are the entities being discussed
about, and are they being discussed
about in a positive or negative way?
3. What attributes of the
entity are discussed, and
what are the sentiments
expressed about them?
4. What do people think about this candidate or issue?

How well did you know this?

Not at all

Perfectly

Challenges of Sentiment Analysis

Cultural and demographic differences between authors.
Discerning between feature-specific sentiment.
Quantifying the hundreds of emotional states which are part of the human condition.

How well did you know this?

Not at all

Perfectly

Plutchik emotion
framework classifies 8
evolutionary emotions

Anger, Fear, Anticipation, Surprise, Joy, Sadness, Trust, Disgust

How well did you know this?

Not at all

Perfectly

Document Polarity

Instead of trying to predict emotional states, an easier
approach is to simply state whether a document is positive or negative.
This is referred to as the polarity of a document.
The approaches to calculating polarity vary and can either be fairly straightforward or rather sophisticated.

How well did you know this?

Not at all

Perfectly

Opinion Words

A straightforward approach to calculating polarity involves the use of certain words that are associated with a particular emotional state.

How well did you know this?

Not at all

Perfectly

Some examples of opinion words

Opinion words are often adjectives and adverbs (e.g. “good”, “bad”, “excellent”, etc.), although nouns (e.g., “trash”) or verbs (e.g., “annoy”) are also sometimes used.

How well did you know this?

Not at all

Perfectly

Opinion or Sentiment Lexicon

A collection of opinion words along with their polarity form what is known as an opinion or sentiment lexicon.
Sentiment lexicons are created either through crowd
sourcing or by the labor of an author, and then validated by crowd sourcing or research.
We can calculate the polarity of a document by simply
adding up positive words and subtracting negative words.

How well did you know this?

Not at all

Perfectly

Bing

Categorizes words in
a binary fashion into
positive and negative
categories.

How well did you know this?

Not at all

Perfectly

AFINN

Assigns words with a
score that runs
between -5 and 5,
with negative scores
indicating negative
sentiment and
positive scores
indicating positive
sentiment.

How well did you know this?

Not at all

Perfectly

NRC

Categorizes words
into categories of
positive, negative,
anger, anticipation,
disgust, fear, joy,
sadness, surprise,
and trust.

How well did you know this?

Not at all

Perfectly

Loughran

Study These Flashcards

Categorizes words
into categories of
negative, litigious,
positive, uncertainty,
constraining and
superfluous.

What does the get_sentiments() function do?

Study These Flashcards

Returns a specific sentiment lexicon in tidy format.
Values for lexicon are either “afinn”, “bing”, “nrc”, or “loughran”.
Other than using the get_sentiments(), we can also refer to the sentiments dataset directly.

Note with sentiments that:

Study These Flashcards

Not every English word is represented in the lexicons.
The words do not take into account qualifiers. For example “no good” or “not true”.
The size of the text analyzed can have an impact on the results.

Sentence Level Sentiment Analysis

Study These Flashcards

Takes valence shifters into consideration in an effort to do more accurate sentiment analysis.

What are valence shifters?

Study These Flashcards

Valence shifters are words that have an impact on the

overall polarity of a message.

The 4 categories of valence shifters:

Study These Flashcards

negators
amplifiers
de-amplifiers
adversative conjunctions

Negators

Study These Flashcards

Flip the sign of the polarized word.

“I do not love apple pie.”

Amplifier

Study These Flashcards

Increases or intensifies the impact of a polarized word.

“I really love apple pie.”

De-amplifier

Study These Flashcards

Reduces the impact of a polarized word.

“I hardly like apple pie.”

Adversative Conjunction

Overrules the previous clause containing a polarized word. “I would love to bake apple pie but it’s not worth it.”

What does the sentimentr package do?

- Designed to quickly calculate text polarity sentiment at the sentence level. - Optionally allows for aggregation by rows or grouping variable(s).

What does the get_sentences() function do?

- Performs sentence boundary disambiguation. | - Returns a list of vectors of sentences.

What does the sentiment() function do?

- Approximates the sentiment (polarity) of text by sentence. - Several polarity and valence shifter dictionaries can be used. - Returns data table of element_id, sentence_id, word_count, and sentiment.

What does the sentiment_by() function do?

- Approximates the sentiment (polarity) of text by group. | - Returns data table of element_id, sentence_id, word_count, sd and ave_sentiment.

B04 Sentiment Analysis Flashcards

(29 cards)