Test 2 Flashcards
Assess the strength of linear association between variables
Correlation
One is not dependent on the other
equation that best describes linear relationship between variables. Manipulate X to see what Y does
one variable is assumed to vary linearly depending on the level of the other
Linear regression
can be (-1 - 1). the closer to 1 or -1 = a strong linear association. its the magnitude and direction
Correlation Coefficiant
How two variables move together
(-1 to 1) positive or negative
Corrleation
How two variables vary from their means
Also how two variables move together
Covariance
- assumes that each variable follows a normal distribution
- Also follow a bivariate normal distribution ( its a moultivariate distribution)
- both variables are continuous
Pearson’s Correlation Coefficient
- Does not follow a normal distribution
- Alternative Nonparametric measure of correlation
- Assess linear and non-linear association(curves)
usually works off of ranks
Spearman’s
Y is always ______ on x
Dependent
x - independent
y - dependent
The equation of the line is completely dependent on what
The slope and in the y-intercept
What determines the values for the estimates of the slope and the y-intercept? It determines the best fit line. The sum of all the distances is what we are trying to minimize
Least Squares
what measures the expected rate of change in the dependent variable for one unit increase in the independent variable?
The SLOPE
situation in which the variance of the dependent variable is the same for all the data. The variability of BMI is the same for a person that is 5’10’’ and for a person that is 5’5’’.
Homoscedasticity
What does the null say with correlation coefficients?
There is no linear association N=0
The relationship between two variables
One is always dependent on the other
Simple linear regression
The measure of the rate of change in response variable for a unit increase in the independent variable
Slope in linear regression
What helps us estimate the mean value of y at a given value of X?
Population regression line - this is what we want to know so we can estimate it for the entire population
what method uses the equation of the line - y=a+bx - to describe the relationship between two variables
Regression
linear regression
the sampling variability - how the slope will vary from sample to sample is _____
standard error
what two values are used to calculate the std error of the slope
MSE and sum of squares for X
Which of the following statements best describes the slope parameter in a linear regression equation
measures the rate of change in the response variable for a unit increase in the independent variable
The relationship between two variables
One is always dependent on the other
Simple linear regression
Which of the following is used to test for the association between two categorical variables?
a single sample from same population and measure hair and eye color (2 variables) and test how these are associated
– Chi square
What helps us estimate the mean value of y at a given value of X?
Population regression line - this is what we want to know so we can estimate it for the entire population
Which of the following test is most appropriate for categorical data obtained from paired samples?
Before and after characteristics!! - is the proportion different on two occasions?
McNemar’s test
What test is used to measure one variable in two different populations?
depression in Males and Females
Homogeneity
what two values are used to calculate the std error of the slope
MSE and sum of squares for X
Which of the following statements best describes the slope parameter in a linear regression equation
measures the rate of change in the response variable for a unit increase in the independent variable
what is the strength of association; how reliable is the the association?
Degree of association