Lecture 1 Flashcards

Question

anscombe's quartet

Answer 1

even if 4 datasets have the same summary statistics (mean, variance, correlation coefficient and best fit-line) they look the same visually. But this is not the case. Effect of curvature and outliers can be huge. Statistical descriptors are incomplete descriptors of the underlying data!

Answer 2

describe the relationship between random variables predict the value of one variable based on another variable

Answer 3

outcome, response, target (on y-axis)

Answer 4

input, predictors, features (on x-axis)

Answer 5

parametrized function y = F(x) x --> f() ---> F(x) you just put x in some function and you get the function of x. this is the learning task, could be predictor or classifier

Answer 6

The model learns from the data input given to it and then uses this learning to classify new observation. Each instance (point) has a class label, can be represented by feature vectors, e.g. [X1X2] (in case classes are defined by 2 features) The classes are devided by a descision bounary

Answer 7

A descision bounary is nothing more than the function of x ( so f(x), the learning task, the classifier, the model). This function, this classifier, is trained on the dataset (labelled data points) to 'draw' a division between classes. ITERATIVE PROCESS The decision boundary is considered to be a model of the separation between classes. NOTE There is a difference between the terms algorithm and a model. Algorithm = a mathematical technique or equation (that is, a framework) with parameters. Model = equation that is formed by using data to find the parameters in the equation of an algorithm.

Answer 8

linear (stright line) non linear (wiggly line), depends on number of parameters and polynomials

Answer 9

feature selection (select relevant features) feature extraction (define relevant features) e.g. PCA, FA for • Image Processing: edge detection • From pixels to reduced set of features https://www.youtube.com/watch?v=LDhqqxOVqV0

Answer 10

reduce complexity and easier interpretation reduce demand on resources (computation/RAM) reduce 'curse of dimensionality' reduce change of overfitting

Answer 11

Identifying rules that describe specific patterns within the data. For example, supermarkets used market-basket analysis to identify items that were often purchased together—for instance, a store featuring a fish sale would also stock up on tartar sauce.

Answer 12

Covariance - Tries to look into and measure how X and Y change together. - tells us direction in which two variables vary with each other. - Covariance can be classified as positive covariance (that large values of one variable are associated with big values from the other) and negative covariance (large values of one variable are associated with small values of the other one) + no trend ======================== Correlation. - Both, the direction and magnitude of how X and Y vary with each other. - Serves as a scaled version of a covariance, make covariance unitless, sort of normalise (same idea as variance and standard deviation) - three categories: positive, negative, or zero

Answer 13

Process of using computer science (algorithms and database methods) and statistics, to find applicable, usable knowledge (like patterns, trends and relationships) in raw (big) data. We apply an algorithm that "learns" something about the data. These algorithms are machine learning algorithms. NOTE There is a difference between the terms algorithm and a model. Algorithm = a mathematical technique or equation (that is, a framework) with parameters. Model = equation that is formed by using data to find the parameters in the equation of an algorithm.

Lecture 1 Flashcards

(37 cards)