02 End-End ML Project Flashcards

1
Q

what is correlation

A
  1. it measures the degree of association or relation between 2 or more variables.
  2. it helps to determine the pattern between 2 variables.
    3.value ranges from -1 to +1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does correlation of -1 mean

A

it indicates that if 1 variable increases another variable decreases and the relation is strong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what does correlation of +1 mean

A

it indicates that if 1 variables increases other variable also increases and the relation is strong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does correlation 0 mean

A

the relation between both the variables is low.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

data processing required before calculating correlation

A

removing of outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

formula of correlation

A

cov(x, y)/sdx*sdy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is covariance

A

it tells us the direction in which both the variables chenge.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

+ve covariance

A

both the variables are moving in the same direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

-ve covariance

A

if 1 variable is increasing other variable is decreasing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Formula of covariance

A

sum(Xi-Xmean)-(Yi-Ymean)/(n-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

types of data transformation

A
  1. min-max scaler AKA Standardisation
  2. normalization.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is min-max scaler

A
  1. the transformed range between 0 - 1
  2. it is highly affected by outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

formula of min-max scaler

A

(x-xmin)/(xmax-xmin)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is standardization

A
  1. the mean of the transformed value is 0 and the variance will be 1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are all Regression Model Metrics

A
  1. Mean Sq Error (MSE)
  2. Root Mean Sq Error (RMSE)
  3. R-Square (R^2)
  4. Adjusted R-Square
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is MSE?

A

Mean of the difference between actual and predicted value.

17
Q

what is RMSE?

A

it the square root of the mean of square of difference between actual and predicted.
if RMSE is 90 than the gap between actual and predicted is 90.

18
Q

What is R^2?

A

with RMSE it is difficult to understand which model is better for different problems hence we use R^2.
formula = 1-(RSS/TSS) where RSS is Residual Sum of Sq and TSS is Total Sum of Sq.

19
Q

what is adjusted R^2?

A

R^2 does not change when new variables are added and hence we use
adjusted R^2 to understand the performance of model when new features are added.