Task 4 Consensus Flashcards
Associated
describes the relationship between two variables
→variables are associated if knowing the values of one variable tells you something about the values of the other variable that you would not know without this variable
Response variable
measures an outcome of a study (dependent variable) y
Explanatory variable
explains or causes changes in the response variable (independent values) x
Scatterplots
shows the relationship between two quantitative variables measured on the same individuals. Values of one variable on the horizontal axis the other on the vertical axis.
→explanatory variable on the horizontal axis
Examining a scatterplot
→look for the overall pattern and for striking deviations from the pattern
→overall pattern can be described by form, direction, and strength
→deviation mostly important are outliers
linear (scatterplot)
• When the data lie in a roughly straight line in a scatterplot
positive association
when above average variables tend to accompany the above average values of the other variable. Same with the below-average values (when the graph goes from the lower left corner to the upper right corner
negative association
when above and below average values of different variables occur together (when the graph goes from the upper left corner to the lower right corner)
Correlation
When r is positive it is positive association and visa versa
When r goes near –1 or +1 one it is a strong relationship
The two way table
is used when both variables are categorical, that gives counts for each combination of values of the two categorical variables
Row variable (two way table)
(met requirement) because each horizontal row in the table describes whether or not was met.
Column variable (two way table)
age is the column variable because each vertical column describes one age group
Joint distribution (two way table)
dividing the cell entry by the total sample size, the collection of these proportions is the joint distribution
Marginal distribution (two way table)
Divide one variable with the total amount of both variables
Conditional distributions
When we condition on the value of one variable and calculate the distribution of the other variable, we obtain a conditional distribution (how many have met the requirement and how many not