C9 Flashcards
levels of sentiment analysis
document level: sentiments score for complete text (eg. review or Tweet)
sentence level: sentiment score per sentence (review may address multiple aspects)
entity and aspect level: relate the sentiment to features of a product, event or entity
sentiment classes
negative, positive, neutral
alternatives: objective vs. subjective, joy/anger/fear/etc., stance (pro/con/neutral)
ordinal scales
ordinal regression
learn a model to predict class labels on an ordinal scale
- variant of regression for ordinal variables
- a problem between regression and classification (“ordinal classification”)
P(y ≤ j | theta_j, w, X) = 1 / (1 + e^(-(theta_j - Xw))
y = target variable
theta_j = threshold for class j
X = input instances
w = weights to be learned
aspect-based sentiment analysis
find quintuple (E, A, S, H, C)
E = opinion target (entity, event or topic) (given by metadata in reviews, or extract from text)
A = aspect or feature of E (aspect categorization needed (can be challenging), aspects are domain and product dependent)
S = sentiment/opinion content (sentiment score of A)
H = opinion holder (the author or extract from (news) text)
C = context; time and location of the expression (data/location stamp in Tweets or reviews, else extract from text)
why does it help to have a product database?
- To know which products exist (someone might mention a different product in the review)
- To know which aspects a given product type has (a drill does not have cleanliness as relevant aspect)
- This facilitates aspect extraction (know what to look for in the text)
challenges of sentiment analysis
- sentiment words do not always express sentiment (“can you tell me which camera is good?”, “If I see a good camera, I will buy it”)
- Sentiment words are ambiguous, context- and domain dependent.
- Sarcasm (“great headphones if you enjoy the noises of other people”)
- objective sentences that express sentiment (“the washing machine uses a lot of water.”)
evaluation of sentiment analysis
discrete labels => precision and recall, average F-score only on positive and negative labels
regression => RMSE
repeatability
Same team, same experimental setup: can you find your own result again with your own hardware, code, and data?
reproducibility
Different team, same experimental setup: same artifact (code, data, experimental set-up) as the original researchers.
replicability
Different team, different experimental setup: someone else can find the same results (e.g. “Transformers are better for this problem than SVM!”) with their own code