Midterm 2 - Chapter 12 Flashcards
When do we need stats in the research process?
After collecting data - need to summarize & communicate findings
2 types of stats:
- Descriptive Statistics
- Inferential Statistics
How should we most efficiently present research:
Want to convey maximum information using minimum space
Purpose of descriptive statistics?
Summarizes mass of data points
- Understanding and interpretation
- Visual displays, appropriate calculations
In experiments can calculate within each
- condition/group
- Mean, standard deviation…
In correlation designs
- For each variable, calculate mean, standard deviations, etc
- For every pair of variables, calculate a correlation coefficient (also a descriptive statistic)
3 Types of Descriptive Statistics
- Measures of Central Tendency (Mean, Median, Mode)
- Measures of Variability (Range, Variance, Standard Deviation)
- Measures of Relationship (Correlation, Multiple Regression, Multiple Correlation, Partial Correlation)
Scales of Measures
- Nominal
- Ordinal
- Interval
- Ratio
Nominal
Group or categorization
- No order or direction
- Summarized by proportion/percentages or the mode
Ordinal
Ranked order (1st, 2nd, 3rd..)
- Uneven spaces between “scores”
Interval
Numerical scales in which intervals have the same interpretation throughout but no true zero (e.g. temp in celsius - 0 deg still indicates a temperature)
Ratio
An interval scale with a true zero reference point (e.g. 0 pounds)
- Summarized with the mean or median and standard deviation
Measures of central tendency
- Describe what’s happening at middle of data
- What’s “normal”?
3 measures of central tendency:
Mean, Median, Mode
Mean
= arithmetic average
- What we usually use!
- Uses information from every single score
- Add up all scores in each group and divide by the
number of scores in each group
Downsides of Mean:
□ Affected by outliers (i.e., extreme scores)
Upsides of mean:
□ With increasing sample size, each extreme score has less effect on the mean.
□ Maximizes use of all of our data.
□ Has mathematical properties that enable us to
use it in statistical analysis.
Outliers - With increasing sample size, the mean is
Less affected by outliers
□ Main idea here - check for outliers if you only have a small sample, but try to get a large sample
Median
= score that divides group in half
- 50% of the scores above, 50% below
- Used if there are extreme scores (outliers)
How to find median:
Put scores in order. Count number of scores.
If odd #: identify the middlemost score.
If even#: identify two middle scores, take average of them.
When is median useful?
Whenever it’s most descriptively informative to report the value for which equal numbers of people score higher
and lower (e.g. income)
- Also, when you can spot an outlier
Mode
= most frequently occurring score
- Sometimes no mode; sometimes more than one
- Usually used for nominal or ordinal variables
- Put the scores in order – look for most frequently occurring score(s)
- May be none, or more than one
When is mode useful?
Whenever it’s most descriptively informative to
report the most frequently occurring score (e.g.: employee salary distribution)
Measures of Spread:
- Variability
- Standard Deviation
Variability:
The spread in a distribution of scores
AMOUNT of spread is often measured by Standard Deviation
How much each score deviates from the mean
Possible Measures of Variability
□ Range (max – min)
□ Variance
□ Standard Deviation
Issues with range
can be too simplistic
Issues with variability:
not very descriptive
Standard Deviation
□ A measure of variability that enables reference to the Normal Distribution so it’s meaningful (as opposed to variance).
□ Defines what’s “normal” for that variable
How to calculate variance:
- Find out how much each score deviates from the mean (mean of 5, 0 = 5)
- Square each number
- Add up these numbers
- Divide by TOTAL number of scores being calculated MINUS 1 (not the total of the numbers, but how many there are - 7, 3: not 10, but 1)
One half of the bell curve, SD of +/-1, SD of +/- 2, SD of +/- 3
- 50%
- 68%
- 95%
- 99.7%
How to calculate Standard Deviation:
SQUARE ROOT of variance
Measures of Relationship:
3 types of Descriptive Stats
- Correlation
- Multiple Regression
- Multiple Correlation
- Partial Correlation
Correlation (r) and r-squared
□ +/- = “direction” of relationship - Positive or negative?
□ Number = “strength” of relationship?
(How closely is one set associated with other set?)
For linear relationships!!
- r = 0 could mean no relationship OR a non-linear relationship!
Correlations - Restriction of range
Correlations can be misleading if the full range on both variables is not measured