Sampling and Probability Flashcards
Name three common sampling techniques.
self-selection
quota sampling
random sampling
Self-selection
participants sign up
- convenient
- might over-represent people with strong opinions etc.
Quota sampling
determine what proportions of people to sample based on whether they fit some pre-set constraints or categories
- might miss some important categories
- might miss participants who don’t fit pre-set categories
Random sample
selected from the population by a process that ensures each possible sample of a given size has an equal chance of being selected
- avoids sampling bias; should be representative
- sometimes impractical but best option
Sampling with or without replacement
with: same individual can be chosen twice (name can go back in before next draw)
without: individual can only be chosen once
Hypothesis testing
determine which process best accounts for the data
- systematic processes (rare)
- random processes (default assumption - simpler and more conservative)
- a combination
Randomness
variability that cannot be accounted for (prediction error) –> with certainty
may be due to some systematic process that we don’t understand yet or haven’t measured
- can say something about the relative likelihood of different outcomes: probability
Probability
p(A) = the probability of the occurrence of event A –> always falls between 0 and 1
if A is certain to occur, p(A) = 1
if A is certain not to occur, p(A) = 0
allows us to quantify uncertainty and use it to make predictions
a priori:
p(A) = # outcomes classifiable as event A [divided by] total # of possible outcomes
or a posteriori:
p(A) = # of times A occurred [divided by] total # of occurrences
Chances
probabilities expressed as percentages
Odds
probability that the event happens, compared to the probability that it doesn’t happen
Sample space
contains all possible outcomes of a random process (exhaustive set of events)
coin toss: {heads, tails}
die roll: {1, 2, 3, 4, 5, 6}
p(sample space) = 1
Event
possible outcome
Mutually exclusive events
events that cannot occur together
ie, one event’s occurrence precludes the other event’s simultaneous occurrence
p(A and B) = 0
when there are only two mutually exclusive outcomes, they can be “P” and “Q”
P = 1 - Q
A priori probability
before the fact (without experience)
theoretically derived
not based on collected data
p(A) = # outcomes classifiable as event A [divided by] total # of possible outcomes
A posteriori probability
after the fact (with experience)
empirically derived
based on collected data
p(A) = # of times A occurred [divided by] total # of occurrences