Chapter 13 - Inferential Statistics Flashcards
1
Q
inferential stats
A
- Use data collected on a sample to infer what is happening in the population
- Is the effect we found in our sample due to random chance, or due to a true effect in the population?
2
Q
why are we often wrong about inferential stats?
A
- We focus on random (rare) events
- Fail to use probabilistic information when making judgments
- Focus on specific rather than general information (Eg. base rate fallacy/”person-who” statistics - we know that smoking is bad, but someone will say “well my grandpa smokes a pack a day and he’s still healthy”)
- See patterns in randomness (Ex. Seeing pattern between mass shootings and violent video games -> although the shooters had a bunch of things in common (ie. Both brush teeth), we pay attention to commonality that we feel makes a pattern)
3
Q
random sample
A
- Taking random sample from population to estimate true effect
- As sample size decreases, estimate is less accurate of what really happens in the population
- Is the sample representative?
- samples are imperfect assessments of overall probabilities
- ex. In any fun-sized packet of Smarties, how many of each colour are there?
4
Q
population level
A
- ex. in the Smarties factory, how many of each colour are made?
5
Q
probabilistic trend
A
- What is the true overall effect?
- Not likely to be reflected in every sample and case
- The smaller the sample, the less likely you’ll see a probabilistic trend that reflects the overall effect
- ex. The expected proportion of each colour of Smarties in the population of Smarties at the factory
6
Q
large vs. small samples
A
- Small samples subject to more error in estimating population value
- Even with random assignment, problem still remains
- Random assignment works best with large sample sizes (chance plays a major role in statistical analysis and research methods)
7
Q
random sampling vs. random assignment
A
- Random sampling: when everyone in the population has an equal chance of being chosen as a participant (if you don’t do random sampling, bias can be introduced)
- Random assignment: when all participants have equal chance of being chosen to be in experiment or control condition
8
Q
directional null hypothesis
A
- assumes there is no effect on population and any difference is due to error -> has to account for everything your research hypothesis doesn’t account for
- Scientific notation: H0
- ex. Mean 1 is less than or equal to mean 2
- True effect: California label will not make wine taste better in population
- Says that random chance caused Mean 1 > Mean 2 in our sample
9
Q
directional research hypothesis
A
- assumes that the means are not equal in the population
- ex. Mean 1 > mean 2
- Scientific notation: H1 or HA
- True effect: California label makes wine taste better
- Random chance is very unlikely explanation that mean 1 > mean 2 in our sample
10
Q
non-directional research hypothesis
A
- assumes that the means are not equal in the population
- ex. Mean 1 doesn’t equal mean 2
- Scientific notation: H1 or HA
- True effect: label affects taste perception in the population
- Random chance is a very unlikely explanation of the difference between groups in our sample
11
Q
non-directional null hypothesis
A
- assumes there is no effect on population and any difference is due to error -> has to account for everything your research hypothesis doesn’t account for
- ex. Mean 1 equals mean 2
- Scientific notation: H0
- True effect: label doesn’t affect taste perception in the population
- Random chance caused any difference between groups in our sample
12
Q
when analyzing data, start by assuming that the ____ is true
A
- null hypothesis (meaning your results are due to random chance)
- as you go, figure out if you can reject it
- Whether something is due to chance is always the most parsimonious alternative explanation for any research finding
13
Q
basics of sampling distribution
A
- aka: binomial distribution or null hypothesis sampling distribution
- distribution of probability that a statistic would emerge in pop based on many previous samples -> what result would you get from data if null hypothesis is true and results are only due to random chance?
- Helps establish critical values -> threshold for determining significance
- ex. If you claim that you have magical powers to sense and pull out purple smarties from a box, what you need to know first is how many purple smarties other people would randomly pull? -> sampling distribution
- from there, you can use your sampling distribution to establish how many purple smarties one would have to pull out of a box to be magical -> critical value
- ex. if people randomly pick 4 purple smarties on average, you might say your critical value is 8 smarties
14
Q
how to calculate statistical significance
A
- Calculate a statistic that captures the effect (eg. chi square, t or f value, correlation, etc.) -> aka find obtained t value
- Refer to sampling distribution for comparison (what’s the expected statistic value for this sample size?) -> aka find critical t value
- Decide if our statistic value is rare enough to consider it significant -> if yes, reject null hypothesis and publish, if not, retain null hypothesis
15
Q
degrees of freedom (df)
A
- used to locate appropriate sampling distribution
- formula: N – 2 (total sample minus number of groups)
- ex. If N = 6, df = 4
- df is dependent on sample size, and more = better