Machine Learning Flashcards

1
Q

Define Machine Learning.

A

A computer is said to learn from experience E with respect to a class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What’s the difference between top down and bottom up?

A

Bottom up: give the system lots of data to discover for itself
Top down: model all different functions and wire these together, and write manual instructions for solving a task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is machine learning top down or bottom up?

A

Bottom - up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the process of using machine learning to reach AI?

A
  1. analyse the task manually
  2. select appropriate model/s
  3. Training and testing
  4. Use the same model to solve unseen tasks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 3 pillars of machine learning?

A

1.Models and algorithms
2.Computation (powerful and cheaper)
3. Massive data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the model and algorithm pillar of machine learning.

A

Doesn’t require knowledge engineers to find all functions and ‘wire’ them in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain the data pillar of machine learning.

A

We have massive amounts of data and the more we can feed into machine learning the more successful it will be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is data mining?

A

Exploration and analysis of large quantities of data to discover valid, novel and useful patterns that are understandable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a big difference between data mining and machine learning?

A

Data mining gives understandable patterns so it can be explained why a conclusion has been come to as opposed to through machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two types of learning in machine learning?

A

Supervised and Unsupervised Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Top-down and bottom-up are also known as?

A

Deduction (top-down)
Induction (bottom-up)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe supervised learning.

A

Learns from labelled data which consists of an input and a desired output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe Unsupervised learning.

A

Learns from unlabelled data to describe a hidden pattern/structure
e.g. to reach a model or regression or grouping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Name three types of supervised learning.

A

Linear regression, decision tree, ANN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is reinforcement learning? What is the goal?

A

Learning over time based on trial and error / awards from actions. Goal is to learn a process to maximise reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some examples of reinforcement learning?

A

Deep neural networks, Q-learning, deep Q networks

17
Q

What are the two main differences between supervised and unsupervised learning?

A
  • supervised has a relationship between Y and f(x), unsupervised usually doesn’t.
  • supervised uses a labelled data set, unsupervised uses an unlabelled set
18
Q

In data cleaning: what is the meaning of imputation?

A

Replacing missing values with realistic estimated values

19
Q

What is clustering?

A

The process of grouping unlabelled data based on a similarity or distance metric

20
Q

What’s the point of a test data set?

A

To check how well a trained data set performs

21
Q

What is regression training?

A

Finding a function based on given data that minimises its mean squared error to ‘fit’ the samples

22
Q

What is overfitting?

A

Fitting a model tightly to a small data set which increases the risk of fitting to an anomaly or noise data

23
Q

What are 3 pros of linear regressions models?

A
  • short training time
  • easy to interpret
  • easy to implement
24
Q

What are two cons of linear regression models?

A
  • sensitive to noise and outliers
  • cannot handle complicated relationships
25
Q

What is classification learning?

A

Learning to predict which set an instance belongs to based on pre-labelled instances

26
Q

In the decision tree model, what is an internal node?

A

A decision point where a test is made on an attribute of the data and a decision is made

27
Q

In the decision tree model, what is a branch?

A

The course of a decision or action

28
Q

In the decision tree model, what is a leaf node?

A

A predicted class label / output reached through a path of decision points (internal nodes) and branches

29
Q

What are 4 pros of decision tree?

A
  • Reasonable training time
  • Can handle large number of features
  • Easy to implement
  • Easy to interpret
30
Q

What are 4 cons of decision tree?

A
  • Only simple decision boundaries
  • Problems with lots of missing data
  • Cannot handle complicated relationships
  • Over- complex tree: potential for overfitting
31
Q

When we only have one dataset, why do we need to split it?

A

To have data for training and data for testing

32
Q

What stage happens before the training set is fed into the machine learning method?

A

Data pre-processing

33
Q

What is a confusion matrix?

A

It represents how much data is predicted to each outcome and how many of those are predicted correctly and incorrectly

34
Q

What is the concept of perceptron in Neural networks?

A

There is a set of weighted inputs that if they surpass a certain amount, it causes a set of outputs.

35
Q

What is the differing meaning of positive and negative weights for neural network inputs?

A
  • a positive weight encourages the neuron to fire as a result of that input
  • a negative weight discourages the neuron from firing as a result of that input
36
Q

What are the 4 logic functions?

A
  • Or
  • And Not
37
Q

What does the logical OR function do in ANNs?

A

one or other of the inputs needs to be one

38
Q

What is the logical function ‘And Not’ in ANN?

A

A AND NOT B
means A must have a value of 1 and B a value of 0