Semester 1 Flashcards
Accuracy?
(TP+TN)/(TP+TN+FP+FN)
Bayes Theorem
P(A/B)=P(A)*P(B/A)/P(B)
Posterior=Prior*Likelihood/Evidence
Bayesian vs frequentist statitics
Bayesian asumes that the data are true, and that the parameters don’t have a single value, but a range of plausible values
Bernoulli distribution?
Single trial, only 2 possible outcomes.
Convolutional Network?
Decision Tree?
DSFI?
Error due to bias?
Model doesn’t learn the patterns of the training data. It oversimplifies the patterns or learns the wrong patterns
Error due to variance?
Model pays too much attention to patterns in the training data and “memorizes” the patterns. The model isn’t flexible enough to generalize into new data
Gamma distribution?
Continuous probability distribution that models right-skewed data
Gradient Boosting Machine
How to measure dependence between variables if the variables are continuous? Discrete?
- Pearson.
- Spearman.
KNN?
K Nearest Neighbours. Classification or Regression. Finds the K nearest points to a given new data point and uses the majority or average vote to predict the target variable.
Likelihood estimation method?
What is the likelihood of the variable having a certain distribution, given the observed data?
Precision?
TP/(TP+FP)