Relationships Between Variables Flashcards
What is a better way to show data in tables and why?
In percentages because you can see clearly the relationship between variables.
How do you use tables with two continuous variables?
Break data down into bands so that information can be shown more clearly and compact.
How do you use tables with one continuous and one nominal variable?
A simple can be used, but the mean and other descriptive statistics for the continuous variable are normally given.
Using diagrams: Two nominal variables
You can use either a compound or a stacked bar chart. May need to collapse some categories.
What is a bad stacked bar chart?
One with too many bars, doesn’t clearly show the relationships. Looks too busy.
Using diagrams: Two continuous variables
Use a scatter plot and then add a line of best fit to help visualise the relationship between the two.
What can a line of best fit show?
It shows whether a relationship is positive/negative. If it is steep then there is a strong relationship and the variables have a great affect on each other and if it is flat then they don’t.
What direction is a positive line of best fit?
Bottom left to top right.
What direction is a negative line of best fit?
Top left to bottom right
How do we quantitatively estimate how the two variables covary?
By using the covariance formula
What is the covariance?
It is like variance but with two variables.
How do you use tables with two nominal variables?
A cross-tabulation is usually appropriate.
May need to collapse categories if there are too many (e.g. gender and smoker/non-smoker)
How does covariance work?
Like in variance where we find how much one data point deviates from the mean and then sum all the deviations and divide them by the number of points, we do this but with two variables which we times together then sum all the deviations before dividing by the number of data points.
If the deviations have the same sign, what is the product?
Always positive
If the deviations are of opposite sign, what is the product?
Product is negative