Learning from Data Flashcards

1
Q

What are the three main categories of machine learning methods? Provide an example task for each.

A

Supervised: Predicting house prices based on size and location
Unsupervised: Grouping of customers by their spending habits
Reinforcement: Training a self-driving car

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In supervised learning, what do x’s and y’s represent?

A

X is the input and y is the output.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What function g(z) is used in logistic regression? Why do we use this function?

A

It’s the sigmoid function and is used to predict the output as a probability for classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why can’t linear models learn non-linear data distributions?

A

Because linear models are constrained to linear decisions.They can’t comprehend complex relationships like curves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the key difference between classification and regression in supervised learning?

A

Classification has a finite set of outputs, regression has an infinite set of outputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name three problems that are formulated as supervised learning tasks.

A

Image Recognition, Sentiment Analysis, and Imitation Learning/Behavioral Cloning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a decision boundary in the context of classification problems?

A

It’s a line or surface that separates the data into different classes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is 𝛼 in the gradient descent formula?

A

It’s the learning rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What makes neural networks capable of handling non-linear relationships?

A

They use non-linear functions, like sigmoid, to learn and approximate complex patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the difference between linear regression and logistic regression?

A

Linear regression is used for linear data to get a value from an infinite set of outputs, whereas logistic regression is used for classification, to classify data into finite categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range of output values for the sigmoid function?

A

0, 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Name three examples of unsupervised learning problems.

A

Dimensionality Reduction, Clustering of Data, Topic Modeling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the purpose of the cost function in supervised learning?

A

It measures the error between the predicted values and the actual values. Telling how good the machine is learning with it’s data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What relationship exists between model complexity and the ability to generalize to unseen data?

A

High complexity may overfit the data, easier to train, but will tend to memorize it’s training data.
Low complexity may underfit the data, tougher to train, but more likely to generalize better.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly