Session 4 Flashcards by Leona Rethmann

what is information gain?

it is the most common splitting criterion and is based on entropy
-> it measures how much a split reduces entropy (measures the change in entropy before and after splitting)

How well did you know this?

Not at all

Perfectly

what does disorder correspond to?

how mixed the segment is with respect to the values of attribute of interest

How well did you know this?

Not at all

Perfectly

what is entropy?

it is a measure of disorder in the data

How well did you know this?

Not at all

Perfectly

how can you calculate entropy?

How well did you know this?

Not at all

Perfectly

what is a parent set?

original set of examples (data points before splitting)

How well did you know this?

Not at all

Perfectly

what is a children set?

an attribute (e.g., age) can segment the parent set into k children sets (subsets)

How well did you know this?

Not at all

Perfectly

When is an attribute chosen for splitting?

The attribute that reduces entropy the most (= has the highest information gain) is chosen for the split

How well did you know this?

Not at all

Perfectly

what is the formula for information gain?

How well did you know this?

Not at all

Perfectly

what are disadvantages of ID3 decision trees?

tends to prefer splits that result in large number of partition each beaing pure but small (we get a very wide decision tree)
overfitting with less generealization capability (will try to fit in every outlier -> will make a segment for Musk in ranking of CEO salary, even if he is the only one so high up)
cannot handle missing value

How well did you know this?

Not at all

Perfectly

what are the application possibilities of ANN (artificial neural networks)?

spam detection
time series prediction
pattern recognition (how does van gogh paint)
computer games

How well did you know this?

Not at all

Perfectly

how does ANN function?

it functions like human neurons -> learning by making interneuron connections

How well did you know this?

Not at all

Perfectly

what is a single perceptron algorithm ANN?

uses no hidden layer and mimics biology

How well did you know this?

Not at all

Perfectly

how does an ANN work?

inputs go into a propagation function that calculations the net input, then a transform cuntion calculates an activation level, then we reieeve an output

How well did you know this?

Not at all

Perfectly

what is the propagation function?

where inputs are independent variables, such as # of amenities

How well did you know this?

Not at all

Perfectly

what is the activation function?

function/ level that determines whether a neuron (whether the whole process starts) produces an output or not

How well did you know this?

Not at all

Perfectly

how does learning work in ANNs?

Study These Flashcards

comparing computed (predicted) outputs to desired (true target values) outputs of historical cases
is defined as a change of weights between units

what are the three tasks in the process of learning in ANNs?

Study These Flashcards

compute temporary outputs
compare outputs with dired targets
adjust the weights and repeat process

when is a data set linearly separable?

Study These Flashcards

if there exists a straight line (in 2D) or a hyperplane (in highe dimensions) that can perfectly separate all data points of one class from those of another class without any errors

when do we need multilayer perceptron?

Study These Flashcards

what are the three layers in multilayer perceptrons?

Study These Flashcards

input layer: includes single attributes
hidden layers: the middle layer of ANN which has three or more layers - each layer increases the training effort exponentially
output layer: the layer containing the solution of the problem

how does the development process of an ANN look like?

Study These Flashcards

what is the activiation function in MLPs?

Study These Flashcards

relation between the internal activation level and the output
can be linear or non-linear
differentiability means if we can build derivates of the cuntion
there are different types

what are the different types of activation functions?

Study These Flashcards

what are the four types of learning?

Study These Flashcards

supervised learning
unsupervised learning
reinforcement learning (you don’t tell correct output, just say if correct or incorrect)
direct design methods

what are the two times of learning?

1. incremental training (you adapt model step by step, by adding new data incrementally) 2. batch training (you train a model only using a subsample of data at a time)

what are the 5 learning rules in ANN?

1. delta rule 2. gradient descent 3. back propagation 4. hebbian rule 5. competitive leaning

what is back propagation?

- similar to delta rule, but also calculates weight changes for hidden layers

what is gradient descent?

- finding combinations of all weights so that the sum of the squared errors F is minimized - but required high computational complexity in high dimensional spaces

Session 4 Flashcards

(29 cards)