All Flashcards

1
Q

How do you define machine learning? (Formal)

A

A computer program that is said to learn from experience E, with respect to some class of tasks T and performance measure P, if its performance in tasks in T, as measured by P, improves with experience E.

ETP, the experience of the task of ET’s performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you define Machine Learning (Informal)

A

A field of study that gives computers ability to learn without being explicitly programmed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

ETP mean?

A

Experience, Task, Performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does E mean in ETP?

A

The experience of a task that the machine learns from.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does T in ETP mean?

A

The task the machine is doing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does P in ETP mean?

A

The performance measured by the task T.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an example of ETP?

A

Learning how to use a radio, playing chess or checkers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the two common types of machine learning?

A

Supervised Learning and Unsupervised Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does supervised learning mean?

A

Supervised Learning is where the machine gets the correct answers and learns from it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does unsupervised learning mean?

A

Unsupervised learning is where the machine makes structure by clustering the data based on the variables in the data without any knowledge what the results look like.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Is there feedback based on the prediction of the results in unsupervised learning?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is clustering?

A

Grouping things onto groups based on different variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is non-clustering?

A

Finding a structure in a chaotic environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the notation y_i mean?

A

Output or target features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the notation x_i mean?

A

Input features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does the notation (x_i, y_i) mean?

A

The Training example

17
Q

What does the notation m mean?

A

The Training set

18
Q

What does the notation Y mean?

A

The space of the output values.

19
Q

What does h mean?

A

The hypothesis function.

20
Q

What is a cost function?

A

Used to measure the accuracy of the hypothesis function. It also takes the average difference of all the results of the hypothesis.

21
Q

What is the idea of the gradient descent?

A

To minimize the thetas so that the hypothesis function’s results can be closer to y for a training example.

22
Q

What is the formula of a mean squared error cost function?

A

.

23
Q

What is the gradient descent?

A

An algorithm used to simultaneously update the thetas, on which is used to improve the hypothesis.

24
Q

What is the algorithm of gradient descent?

A

T means Theta in this example.
a means alpha

repeat until convergance{
T_j:=T_j-a*d(cost)/dT_j (for j=0 and j=1)
}

25
Q

What is alpha?

A

The alpha is the learning rate of the for gradient descent.

26
Q

What happens when the alpha is too high?

A

The gradient descent can overshoot the minimum and may fail to converge, or even diverge.

27
Q

What happens if the alpha is too low?

A

The gradient descent will be incredibly slow.

28
Q

How do you correctly simultaneously update the thetas?

A

Assign the new values of the thetas into temporary variables first then re-assign the back using those temporary variables in the end.

29
Q

What are derivatives?

A

A description of how a function changes with small or multiple variables.

30
Q

What is the idea of the hypothesis function?

A

To give the predicted output from the given input feature.

31
Q

Cost must go down?

A

Yes

32
Q

What is a batch?

A

Each step of gradient decent uses all the training examples

33
Q

Are there different types of gradient descent?

A

Yes

34
Q

Can gradient descent converge if alpha is fixed? (As long as alpha is not too high or low)

A

Yes

35
Q

Do you have to decrease alpha overtime?

A

No.

36
Q

Can gradient descent find the global minimum of any function?

A

No.

37
Q

Can any linear regression function find the local optima?

A

No, it can only find the global optima.

38
Q

What does optima mean?

A

The relative best solutions.

39
Q

What is regression?

A

Where the machine predicts a continuous output, Like predicting a house price based on multiple variables.