C9 Flashcards

Question 1

Q

levels of sentiment analysis

Answer

A

document level: sentiments score for complete text (eg. review or Tweet)

sentence level: sentiment score per sentence (review may address multiple aspects)

entity and aspect level: relate the sentiment to features of a product, event or entity

Question 2

Q

sentiment classes

Answer

A

negative, positive, neutral

alternatives: objective vs. subjective, joy/anger/fear/etc., stance (pro/con/neutral)

ordinal scales

Question 3

Q

ordinal regression

Answer

A

learn a model to predict class labels on an ordinal scale

variant of regression for ordinal variables
a problem between regression and classification (“ordinal classification”)

P(y ≤ j | theta_j, w, X) = 1 / (1 + e^(-(theta_j - Xw))
y = target variable
theta_j = threshold for class j
X = input instances
w = weights to be learned

Question 4

Q

aspect-based sentiment analysis

Answer

A

find quintuple (E, A, S, H, C)
E = opinion target (entity, event or topic) (given by metadata in reviews, or extract from text)
A = aspect or feature of E (aspect categorization needed (can be challenging), aspects are domain and product dependent)
S = sentiment/opinion content (sentiment score of A)
H = opinion holder (the author or extract from (news) text)
C = context; time and location of the expression (data/location stamp in Tweets or reviews, else extract from text)

Question 5

Q

why does it help to have a product database?

Answer

A

To know which products exist (someone might mention a different product in the review)
To know which aspects a given product type has (a drill does not have cleanliness as relevant aspect)
This facilitates aspect extraction (know what to look for in the text)

Question 6

Q

challenges of sentiment analysis

Answer

A

sentiment words do not always express sentiment (“can you tell me which camera is good?”, “If I see a good camera, I will buy it”)
Sentiment words are ambiguous, context- and domain dependent.
Sarcasm (“great headphones if you enjoy the noises of other people”)
objective sentences that express sentiment (“the washing machine uses a lot of water.”)

Question 7

Q

evaluation of sentiment analysis

Answer

A

discrete labels => precision and recall, average F-score only on positive and negative labels

regression => RMSE

Question 8

Q

repeatability

Answer

A

Same team, same experimental setup: can you find your own result again with your own hardware, code, and data?

Question 9

Q

reproducibility

Answer

A

Different team, same experimental setup: same artifact (code, data, experimental set-up) as the original researchers.

Question 10

Q

replicability

Answer

A

Different team, different experimental setup: someone else can find the same results (e.g. “Transformers are better for this problem than SVM!”) with their own code

C9 Flashcards

(10 cards)