Exam 2: Summary Statistics Flashcards

1
Q

Levels of Measurement

A

Nominal, Ordinal, Interval, Ratio

* Need to know in order to best present and analyze your data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Nominal

A
  • Ideally exhaustive and mutually exclusive categories
  • Was the sentence reportedly correct or incorrectly (or no response)?
  • 2 categories: Binomial aka dichotomous
  • More than 2: Multinomial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ordinal

A

ordered categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interval

A

• Equal distances between scores
• Calculate differences but not proportions
(can be discrete or continuous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ratio

A

• Interval + a true zero
(can be discrete or continuous)
• Percent correct
• e.g., 50% is twice as accurate as 25% (note that this measure is bounded)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Levels of measurement applied to hearing measurement

A

Nominal: hearing loss or normal hearing
Ordinal: degrees, mild-profound

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Descriptive Statistics

A

• Frequencies, Percentages, Proportions

• How often a phenomenon occurred
• Primary summary measure for nominal measures
• But can be used for other data types
• Across participants or behaviors
Mean, Median, and Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

• Measures of Central Tendency

A
  • Mean (Interval, Ratio “M”)
  • Median (Ordinal, Interval, Ratio, “Mdn”)
  • Mode (All types)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Measures of Variability

A
  • Min, Max
  • Range = Max – Min
  • Interquartile range = score at 75th percentile – score at 25th percentile; Relevant if your data have extreme highs or lows

• Standard Deviation: Dispersion of scores around the mean; Colloquially: On average, how much do observations differ from mean?
SD = sqrt(variance)

• Standard Error of the Mean: How far is the sample mean likely to be from the population mean? • SE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Tables

A

• Rule of thumb: Data that requires ≤2 columns or rows for
a table should just be presented in the text
• Arrange/group information logically
Use APA format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Pie chart

A

use when a number adds up to 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Shapes of Distributions

A
  • Important for knowing:
  • The best way to summarize your data
  • The type of statistical test you should perform
  • Some tests assume a particular distribution (e.g., “normally distributed”)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Normal Distribution

A

• “Gaussian curve”
• Largest number of observations at the center
• Symmetric (as many below as above)
• Fewer as you get towards extreme values; 2/3 of observations will fall within 1 SD of the mean
Value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewed Distribution

A

• Not symmetric
• More extreme scores in one direction
• Mean most affected by skew
• More skewed, the more the difference between mean and median
negative skew: more extreme scores on the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Other Distributions

A
  • Distributions can have:
  • Different centers
  • Different variabilities
  • Different kurtosis (peakedness and shape of tails)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Standardized Scores

A

• Accounts for both average and variability of the score
• z-score = (score – M) / SD
~Resulting Mz-score = 0 and SDz-score = 1
~How many SD above or below the mean is a given score?

• Straightforward way to relate a value to a normal distribution and to other z-scores

z scores have directional information

17
Q

t distribution

A
• A t-distribution is a z-distribution that is shifted and scaled such that:
• M = 50
• SD = 10
• Typically used:
• Small sample size
• Unknown population
standard deviation
18
Q

Outliers

A
  • Extremely deviant values
  • Not necessarily inaccurate!

• How to identify?

  • Review your experiment notes
  • Plot your raw data

-Set a priori criteria based on your measure and literature-
• Trimming of reaction times is common
• e.g., more than 2.5 SD slower than the mean, and/or <200 ms
• Box and whisker plots

How to deal with them?
• NEVER EVER remove data without minimally describing:
-how and why you did
-what the impact was on the results
-how much data you’ve removed and was removal equally distributed across conditions

• Problems can arise from interpreting data with real outliers or without “outliers”