Basic Stats & Hypothesis Testing Flashcards
Inferential Stats
Used to interpret or draw inferences about a set of observations (e.g. t-test, ANOVA)
Making inferences about a population from a sample of observations
Descriptive Stats
Used to summarize or describe a set of observations (e.g. mean, variance)
Mean
Average score calculated by the sum of the scores divided by the total number of scores
Sensitive to exact value of all scores in distribution
Median
Midmost score in the series of n scores; must arrange them hierarchically before finding this
Less sensitive to outliers
Mode
Score that occurs with greatest frequency
Can have more than one mode
Measures of Central Tendency
Mode
Median
Mean
Measures of Variability
Range
Variance
Standard Deviation
Deviation Score
Range
Distance between the highest and lowest score
Measure of dispersion
Only gives info about the 2 extreme scores
Variance
Average of [how much each score deviates from the mean]^2
Standard Deviation
Square root of the variance
Deviation Score
Tells us how far away the raw score is from the mean of the distribution
Calculate the mean, subtract each score from the mean
Sum of deviations scores = 0, so we square each deviation score before summing
Shapes of Distributions
Rectangular distribution
Bimodal
Normal (bell curve)
Positively/negatively skewed
Statistic
Computed on a sample
Sample mean (X) is an estimate of the population mean (𝜇)
Sample variance (S2) is an estimate of the population variances (𝛔2)
The sample standard deviation (S) is an estimate of the population standard distribution (𝛔)
Parameter
Computed on a population
Random Sampling (selection)
Choose random subset of the population ensuring that each member of the population has an equivalent chance of being sampled
Examine that sample and use your observations to draw inferences about the population
If a sample is representative is has external validity - important for generalizing the results to the population
Random Assignment
Whereas random sampling refers to the source of the data (and is important for external validity), random assignment refers to how subjects are assigned to groups
Important for integrity of the experiment (internal validity)
Random assignment reduces the likelihood that groups differ in some critical way other than the treatment
Means Have Distributions
If you repeatedly draw samples, compute means, the means will form a distribution
As N increases, the distribution becomes more normal
This allows us to draw inferences about populations from samples drawn from those populations
Standard Error of the Mean
Variability between means if took repeated samples from the population
How much variability you’d expect to find if you took repeated samples from the same population
SD / sqrt[n] - denominator of t test
Standard error of mean allows inferences of populations
Larger n = smaller SEM
Larger SD = larger SEM
Dependent T-Test
Also called paired t-test, matched samples, repeated measures
The same participants give us data on two measures (e.g. before and after treatment)
More powerful
All calculations on difference scores
Value always higher than independent
Dependent T-Test Pros
Eliminate subject to subject variability
Control for extraneous variables
Need fewer subjects
Dependent T-Test Cons
Order/carry-over effects
Subjects no longer naive
Sometimes not logically possible
Variance sum law
Variance of the differences between two independent variables is equal to the sum of their variances