Statistical Methods For Business Analysis Flashcards
What type of random variable are there?
Discrete (whole numbers)
Continuous (values with in a range)
What is the variance?
How much the values differ from the expected value or average.
What is standard deviation?
The typical distance between each data point and the mean or expected value.
What are the requirements of the standard normal distribution?
Expected value = 0
Standar deviation = 1
What means if the null hypothesis is true?
There is no relationship
What means if the null hypothesis is false?
There is a relationship
What is the T statistic
It’s a parameter that measures the distance between the average x and the value of the null hypothesis. How far is x from de expected value.
What we use the P value for?
To say how confident we are and decide if reject the null hypothesis or not.
What does a P value of 0 or 1 mean?
If 0 there is a relationship, the null hypothesis is false.
If 1 there is nothing going on, the null hypothesis is true.
What is the unsupervised statistical learning?
We have a lot of variables and we want to learn something about them without a guide.
What is supervised statistical learning?
We have a target variable (y) and our objective is to learn about the relationship between x and y.
What is the regression about?
Learning about the relationship between Y and one or more Xs
What does it mean if the relationship between X and Y is deterministic.
Y is completely dependent on X, no allow to error.
What does it mean if the relationship between X and Y is probabilistic?
There is allowance for random error and unexplained variation. Error is independent from x.
What is correlation?
Describes the strength and direction of the relationship between variables. How much one changes when the other one changes. Could be from -1 to 1
-1 means perfect negative linear relationship
1 means perfect linear relationship
0 means no linear relationship at all.
What does linear regression mean?
That the relationship between the variables is linear. The change in x results in a PROPORTIONAL change in y.
What is MSE?
The mean square error, mesurares the quality of prediction of our model in regression. We want it to be low.
What are the residuals?
The difference between the predicted response variable and the actual response
variable .
They need to be around 0 without being equal between them.
What does the LS. Least square criterion?
Minimizes the Residual Sum Squares, trying to make the residuals as smalls as possible
What is covariance?
Mesures to what extent two variables change together
What is RSE?
The residuals standard error, the variance of the residuals.
Should be small.
What is R^2?
The coefficient of determination. Tells us how accurately is the model making predictions by measuring the proportion of variance explained by x. Ranges from 0 to 1. 0 bad, 1 good.
In multiple linear regression how can we tell if there is a relationship or not.
If lo the parameters (B) = 0 there are no relationships. If AT LEAST one is not equal to 0 there is a relationship.