Lecture 1 - Introduction Flashcards

1
Q

What is machine learning about

A

Machine learning is about using the right features to build the right models that achieve the right tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are Tasks

A

Problems that can be solved with machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are Predictive Tasks

A

Predicting a target variable from a number of features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are descriptive tasks

A

exploiting the underlying structure of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are 3 Predictive Tasks

A
  • Classification
  • Regression
  • Predictive clustering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are 3 Descriptive Tasks

A
  • Descriptive clustering
  • Association rule mining
  • Subgroup Discovery

Descriptive tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between predictive and descriptive tasks

A

Model output of predictive models involves a target variable, while the model output of the descriptive models does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Predictive Classification

What is Classification

Also give 2 examples

A

Classification tasks predict categorical target variable from a set of features.
* image classification
* weather type prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Predictive regression

What is regression

Give 2 examples

A

Regression tasks predict a numerical target variable from a set of features
* stock price forecasting
* weather temperature forecast

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Predictive clustering

What is predictive clustering

give 1 example

A

Predictive clustering predicts with the intention to assign class labels (predicting a target)
* fraud detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Descriptive Clustering

What is descriptive clustering

give 2 examples

A

The clusters are representing different groups formed in data without the intention of predicting a target.
* grouping plant data
* pattern mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Association rule mining

What is association rule mining

Give 2 examples

A

A rule-based task for discovering interesting relations between variables
* market basket anaylsis
* online shopping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sub-group discovery

What is subgroup discovery

give 2 examples

A

Technique that discovers interesting associations among different variables, with respect to a property of interest
* detection of risk groups with disease
* finding patterns in traffic accidents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Supervised vs Unsupervised

What is supervised learning

A

In supervised learning tasks, we provide a traning set of examples: instances, labelled with the true target value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Supervised vs Unsupervised

What is Unsupervised learning

A

In unsupervised the data is unlabelled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name two Supervised and Predictive models

A
  1. Classification
  2. Regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Name one Supervised learning descriptive model

A
  1. Subgroup discovery
18
Q

Name one unsupervised learning predictive model

A
  1. Predictive clustering
19
Q

Name two unsupervised learning descriptive models

A
  1. Descriptive clustering
  2. association rule discovery
20
Q

What are Models?

A

Models are what is being learned from the data, in order to solve a given task

21
Q

How does a model of regression look like?

Equasion with Yi, Xi, ei

A

Yi = f(Xi+B)+ei
where Yi, Xi, ei are the target, features, and noise of specific instance i, and B and f are model paramters and model function

22
Q

What are the two ways machine learning models can be distinguished

A
  1. Main intuition
  2. Modus operandi (mode of operations)
23
Q

Main intuition

What are geometric models

A

using geometrical concepts. shit like linear tranformations, distance metrics, seperating hyperplanes

24
Q

Main intuition

what are Probabilistic models

A

aim for reducing uncertanty using probability distributions

25
Q

Main intuition

what are Logical Models

A

defined in terms of easily interpretable logical expressions

26
Q

Second Categorization (modus operandi)

What are gouping models

A

dividing the instance space into segments
in each segment a very simple model is learnt

27
Q

Modus operandi

What are Grading models

A

learning a single, global model over the instance space

28
Q

Geometric models

Instance Space?

A

the set of all possible instances, whether they are present in the data set or not

29
Q

Geometric models

Distances?

A

distance between two points
* Euclidean distance for two points. (could work in multiple dimentions)

30
Q

Geometric models

What are Hyper-Planes

A

a decision boundary that divides the input space into two or more regions, each corresponding to a different class or output label

31
Q

Probabilitic models

What is the bayes rule

A

P(Y|X)=P(X|Y)P(Y)
P(X)

32
Q

probabilitic models

What is posterior

A

posterior = (likelihood x prior)/evidence

33
Q

Proabilitic models

P(Y|X)
P(X|Y)
P(Y)
P(X)

A

P(Y|X) = posterior dist.
P(X|Y) = likelihood prob.
P(Y) = prior
P(X) = evidence

34
Q

Logical models

what is meant by Declerative?

A

Declerative: models of this type can be earlisy translated into ruls that are understandable by humans
* Rules can be organised into a feature tree

35
Q

Grading vs Grouping

Difference between grouping models and grading models.

Give 3 points for each

A

Grouping models
* break up the instance space into groups of segments
* grouping models have a fixed and finite resolution
* cannot distingiush between indivual instances beyond this resolution
Grading models
* Do not work based on the notion of segments
* they form one **global model **over the instance space
* infinite resolution

36
Q

Give 1 example of a grouping model

A
  1. Decision trees
37
Q

Give two examples of a grading model

A
  1. Linear regression
  2. Linear classifiers
38
Q

Training vs inference

What is the Training phase

A

Training is the process of creating a machine learning model that has learned ot perform a task using a training set of numerous data points.

39
Q

What is inference phase

A

Inference is the process of using a machine learning model to perform the task on a new data point.

40
Q

what are Features

A

Kind of measurement taht can be easily performed on any instance

41
Q

what if the data is not in the form you want?

A
  1. Feature construction
  2. Discretisation (numerical into categorical)
  3. Feature transformation (project data into a new space)
  4. Feature selection (removing redudntant features)