02 End-End ML Project Flashcards
what is correlation
- it measures the degree of association or relation between 2 or more variables.
- it helps to determine the pattern between 2 variables.
3.value ranges from -1 to +1.
what does correlation of -1 mean
it indicates that if 1 variable increases another variable decreases and the relation is strong.
what does correlation of +1 mean
it indicates that if 1 variables increases other variable also increases and the relation is strong.
what does correlation 0 mean
the relation between both the variables is low.
data processing required before calculating correlation
removing of outliers.
formula of correlation
cov(x, y)/sdx*sdy
what is covariance
it tells us the direction in which both the variables chenge.
+ve covariance
both the variables are moving in the same direction
-ve covariance
if 1 variable is increasing other variable is decreasing
Formula of covariance
sum(Xi-Xmean)-(Yi-Ymean)/(n-1)
types of data transformation
- min-max scaler AKA Standardisation
- normalization.
what is min-max scaler
- the transformed range between 0 - 1
- it is highly affected by outliers
formula of min-max scaler
(x-xmin)/(xmax-xmin)
what is standardization
- the mean of the transformed value is 0 and the variance will be 1.
What are all Regression Model Metrics
- Mean Sq Error (MSE)
- Root Mean Sq Error (RMSE)
- R-Square (R^2)
- Adjusted R-Square