Chapter 5: Identifying good measurement Flashcards
All statistical analysis can be divided into two branches or categories:
Descriptive or inferential
Descriptive
Literally just describing the data in your sample,
with no attempted conclusions about the
entire population. Means, medians, standard
deviations, etc., and some other stuff like effect sizes.
Inferential
Using your data as the basis to make inferences about
how things work in the overall population (i.e.,
beyond just your data). Assessing statistical
significance.
The overall goal of descriptive statistics is
to summarize data in a way that is efficient and accurate.
Measures of central tendency tends to
One aspect of that is providing single a value that represents the many numbers that make up a data set
The most common measures of central tendency are
mean, median, mode
variability
How dispersed or spread out the values in a data set are. Examples of such measures include standard deviation, variance, and range.
To do a good job describing data
you need to provide information on central tendency AND variability. Neither one alone provides enough information.
Bar graph
Top of bar=central tendency, error=measure of variability
Z scores
(x-μ)/σ
x= Individual’s score
μ=Mean score from sample
σ=Standard deviation of the sample’s scores.
Result of standardizing a score (value) with reference to the rest of the scores in the sample (a course section, in this case). Thus, it represents a score’s position relative to the mean in units of standard deviation. Where the score falls relative to the mean.
The correlation coefficient (r-value) is usually considered
a descriptive statistic, not surprising given its description of an association.
This inference relates to whether the conclusions drawn from the sample data can be applied to the population. But how?
A statistical test e.g., a t-test
A test statistic e.g., t
A p-value
Is it statistically significant?
null hypothesis testing
It relates explicitly to inferential statistics and is a bit different from the theory-data cycle’s concept of a hypothesis.
Null hypothesis
is always that the result is NOT statistically significant,
and statistical significance tells the researcher to reject the null hypothesis in favour of the alternative hypothesis.
Rejecting the null hypothesis
The result is statistically significant
Retaining the null hypothesis
A result of this magnitude is not statistically significant+
As mentioned before, the threshold for significance (or alpha, α) is usually
p < 0.05, meaning that the likelihood that the observed results are due to random chance and not “real” (applicable to the whole population) is < 5%.
t-value is to t-test as
r-value is to correlation
t-test
common whenever you are comparing the mean and variability of two groups. The result is again a p-value
an analysis of variance (ANOVA)
a statistical test that works like a t-test but compares more than two groups. After they show that overall difference, they compare between each pair of groups (usually called pairwise or post-hoc comparisons or tests).
Alternative hypothesis
The result is statistically significant.
Overall, there are tons of different statistical tests that can be use depending on the situation (and that you don’t have to know), but they all lead to
a p-value that can be used to assess statistical significance