Week 7 (Actual) Flashcards

Question 1

Q

What is a Decision Tree?

Answer

A

A tree structure which consists of:

Root/internal node (independent)
Leaf (dependent variable)
Branch (decision)

Can be classification or regression.

Question 2

Q

How are decision trees constructed?

Answer

A

Given a data set, group similar samples and look for the best rules that split dissimilar samples.

Question 3

Q

What is the Gini Index?

Answer

A

Given a training dataset of J classes:

IG(p) = 1 - sum pi^2

where pi is the fraction of items labelled with class i in the dataset.

Question 4

Q

What is Information Gain?

Answer

A

The information we gain after splitting the samples based on an independent variable.

IG (Y, X) = H(Y) - H (Y | X)

Question 5

Q

What are the drawbacks of Decision Trees?

Answer

A

Unstable - Small change in data results in large change.

Relatively Inaccurate - Support vector machine and neural networks perform better.

Question 6

Q

What are Probabilistic Graphical Models?

Answer

A

Nodes represent random variables, and edges (links/arcs) represent conditional independence.

Undirected or Directed (Bayesian).

Question 7

Q

What are Bayesian Networks?

Answer

A

A kind of probabilistic graphical model that uses the direction of edges to represent the cause-effect relationship and Bayes theorem for probabilistic inference.

A compact representation of a probability distribution in terms of conditional distribution.

Question 8

Q

What are the advantages of Bayesian Networks?

Answer

A

Graphical Representation: of joint probability distributions of random variables - interpretable.

More powerful: can capture complex relationships.

Combine data and prior knowledge: better approximation.

Generative approach: generate new data similar to existing data.

Question 9

Q

What are the disadvantages of Bayesian Networks?

Answer

A

Requires prior knowledge of many probabilites.

Sometimes computationally intractable.

Question 10

Q

What are the main problems faced in Bayesian Networks?

Answer

A

Inference.

Training the models.

Determining the structure of the network.

Question 11

Q

How do you represent the joint probability distributions of random variables?

Answer

A

A set of nodes: represent random variables.

A set of directed edges: represents “directed dependency”.

A conditional distribution for each node given its parents: P(Xi | Parents(Xi)).

Question 12

Q

What groups do random variables (nodes) fall in to?

Answer

A

Observed: The nodes we have knowledge about.

Unobserved: Nodes we have to infer probability for.

Question 13

Q

What is the Markov condition?

Answer

A

Each random variable X is conditionally indepdendent of its non-descendants, given its parents.