2.4—a statistical primer Flashcards
2.4 Learning Objectives
- know the key terminology of statistics.
- understand how and why psychologists use significance tests.
- significance tests are statistics that tell us whether differences between groups or distributions are meaningful.
- how much variability there is among individuals within each of the groups will determine whether the averages are significantly different.
- in some cases, the averages of the two groups may be different, yet not statistically different because the groups overlap so much.
- apply your knowledge to interpret the most frequently used types of graphs.
- analyze the choice of central tendency statistics based on the shape of the distribution.
2.4 Focus
- how do psychologist use statistics to describe their observations?
- how are statistics useful in testing the results of experiments?
Statistics
- statistics can be boiled down to two general steps:
- organize the numbers so that we can get a “big picture” view of the results.
- this process is helped by the creation of tables or graphs.
- test to see if any differences between groups or between experimental conditions are meaningful.
- organize the numbers so that we can get a “big picture” view of the results.
- once these steps have been completed, it’s possible to determine whether the data supported or refuted the hypothesis.
Descriptive Statistics
- descriptive statistics: a set of techniques used to organize, summarize, and interpret data.
- this gives you the “big picture” of the results.
- the statistics used to describe and understand the data are of three types: frequency, central tendency, and variability.
Distribution
made up of two pieces of information:
- whether some numbers occurred more often than others,
- and whether all of the numbers were clumped in the middle or more evenly spaced across the whole range.
Normal Distribution
(sometimes called the bell curve) a symmetrical distribution with values clustered around a central, mean value.
Negatively Skewed Distribution
a distribution in which the curve has an extended tail to the left of the cluster.
Positively Skewed Distribution
a distribution in which the long tail is on the right of the cluster.
Central Tendency
a measure of the central point of a distribution.
Mean
the arithmetic average of a set of numbers.
Median
the 50th percentile—the point on the horizontal axis at which 50% of all observations are lower, and 50% of all observations are higher.
Mode
the category with the highest frequency (i.e. the most observations).
Variability
- variability: the degree to which scores are dispersed in a distribution.
- high variability means that there are a larger number of cases that are closer to the extreme ends of the continuum for that set of data.
- e.g. a lot of excellent students and a lot of poor students in a class.
- low variability means that most of the scores are similar.
- e.g. a class filled with B-students.
- variability can be caused by:
- measurement errors.
- imperfect measurement tools.
- differences between participants in the study.
- and/or characteristics of participants on that given day (e.g. mood, fatigue levels).
- if information about variability is not provided by the researcher, it is impossible to understand how well the measure of central tendency reflects the entire data set.
Standard Deviation
a measure of variability around the mean.
Hypothesis Test
- hypothesis test: a statistical method of evaluating whether differences among groups are meaningful, or could have been arrived at by chance alone.
- the difference in the central tendency for the two groups represents a “signal” that we are trying to detect.
- the variability represents the “noise,” the outside forces that are making it difficult to detect the signal.