Ch5: Sampling and Probability Flashcards

1
Q

The goal of sampling is simple:

Samples and their populations

A

collect a sample that represents the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

2 main types of sample:

Samples and their populations

A
  • Random
  • Convenience
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pros of random sampling:

Samples and their populations

A
  • Ideal
  • More representative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Cons of random sampling:

Samples and their populations

A
  • Expensive
  • Results in lots of practical problems
  • Almost impossible to obtain
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Convenience sampling - easier to grab samples from a more local portion of the population, but this comes with a significant downside:

Samples and their populations

A

Affects generalizability: refers to researcher’s ability to apply findings from one sample or in one context to other samples or contexts; AKA external validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

While external validity is affected, how can we improve the internal validity of convenience sampling?

Samples and their populations

A

Replication: the duplication of scientific results, ideally in a different context or with a sample that has different characteristics (sometimes called reproducibility); AKA just doing the study again and again

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When must we be even more cautious than using convenience samples?

A

Using self-selected/volunteer samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are volunteer samples becoming more popular?

A
  • Crowdsourcing: occurs when a research team solicits input from a very large group of people, usually recruited online
  • On one hand, this is enabled by the internet
  • On the other, we must be cautious - collecting data through crowdsourcing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mturk - what is it and its downsides?

A
  • Mturk (Amazon Mechanical Turk) - an online website where anyone can recruit people to complete tasks for a small fee
  • While convenient, the limitations of using such volunteer sampling remain (EX - biases)
  • Also subject to fraud - upwards trend of random responses (including randomly selected responses and numbers)
  • Ethical concerns arise with respect to participants, often called Turkers: some note that their hourly pay amounts to less than study participants who are paid through other means, and is less than the minimum wage in countries such as the US and Canada
  • Some say that the online nature of Mturk may not protect their privacy/anonymity
  • For those who use Turkers in their work, there’s now guidelines for being an ethical researcher
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Random assignment = a distinctive signature of a scientific study; WHY?

A

It evens the levels of the playing field when every participant has an equal chance of being assigned to any level of the IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is probability central to inferential stats?

A

Because our conclusions about a population are based on data collected from a sample rather than on anecdotes and testimonials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Two personal biases get intertwined in our day to day thinking:

A
  • Confirmation bias: usually unintentional tendency to pay attention to evidence that confirms what we already believe and to ignore evidence that would disconfirm our beliefs
  • Illusory correlations: the phenomenon of believing one sees an association between variables when no such association exists
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When we discuss probability in everyday conversation, we tend to think of what statisticians call…

A
  • Personal probability: a person’s own judgement about the likelihood that an event will occur, also called subjective probability; really just our best guess
  • EX: “there’s a 75% chance I’ll finish my paper tonight!”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Statisticians are concerned with a different type of probability

A

Probability: the likelihood that a particular outcome - out of all possible outcomes - will occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In stats, we’re interested in a more specific definition of probability…

A

Expected relative-frequency probability: the likelihood of an event occurring, based on the actual outcome of many, many tries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In reference to probability - trial, outcome, success

A
  • Trial: refers to each occasion that a given procedure is carried out
  • Outcome: refers to the result of a trial
  • Success: the outcome for which we’re trying to determine the probability
  • Thus, probability = successes / trials
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Probability, proportion, percentage

A
  • Probability - the proportion that we expect to see in the long run
  • Proportion - the number of successes divided by the number of trials
  • Percentage - probability or proportion multiplied by 100
18
Q

One of the central characteristics of expected relative-frequency probability

A

IT ONLY WORKS IN THE LONG RUN (REFERRED TO AS THE “LAW OF LARGE NUMBERS”)

19
Q

To avoid bias, statistical probability requires that…

Independence and probability

A

…the individual trials be independent - as in that the outcome of each trial must not depend/rely in any way on the outcome of previous trials

20
Q

3 steps to developing hypothesis:

A
  1. Establish control/experimental group
  2. Development of the hypotheses to be tested
  3. Making a decision about the hypothesis
21
Q

When we calculate inferential stats, we’re actually comparing two hypotheses:

A
  • Null hypothesis
  • Research hypothesis
22
Q

Null hypothesis

A
  • A statement that postulates that there is no difference between populations or that the difference is in a direction opposite to that anticipated by the researcher (like > or = to, or < or = to)
  • Can think of as the boring hypothesis, because it proposes that nothing will happen
23
Q

Research Hypothesis

A
  • A statement that postulates a difference between populations
  • Can specify a direction
  • NOTE: important to state the comparison group (i.e. - the group viewing a photo with healthy crackers has a higher avg. calorie estimate, INSTEAD: the group viewing a photo with healthy crackers has a higher avg. calorie estimate THAN the group that views the photo without the healthy crackers)
24
Q

How to we compare the null vs. research hypothesis to determine probability?

A
  • We formulate the null hypothesis and the research hypothesis to set them up against each other
  • We use stats to determine the probability that there is a large enough difference between the means of the samples that we can conclude there’s likely a difference between the means of the underlying populations
  • So, probability plays into the decision we make about the hypotheses
25
Q

When we make a conclusion at the end of a study, the data lead us to conclude one of two things

A
  • We decide to reject the null hypothesis
  • We decide to fail to reject the null hypothesis
26
Q

When data between the null and research hypothesis DO state a difference…

A

we REJECT the null hypothesis; the idea that there is NO mean difference

27
Q

When data between the null and research hypothesis DO NOT state a difference…

A

When data do not suggest a difference, we FAIL TO REJECT the null hypothesis - which is that there is no mean difference

28
Q

WORD TO NOT USE IN HYPOTHESIS TESTING!!!!

A
  • “accept” (rather, in terms of reject)
  • Reject the null hypothesis
  • Fail to reject the null hypothesis
29
Q

What can unrepresentative samples cause?

A

Wrong decisions about accepting/rejecting the hypothesis

30
Q

Type I error

A
  • when we reject the null hypothesis, but the null hypothesis is correct
31
Q

Detriments of a Type I error

A
  • The rejection of the null hypothesis typically leads to action, until we discover that there was an error
  • Many researchers consider the effects of a type I error to be detrimental - particularly because people often take action on a certain finding, but actually based on a mistaken finding
32
Q

Common trends in published research regarding Type I error

A
  • Positive outcomes are more likely to be reported than null results
  • 1: researchers less willing to publish null results
  • 2: journals tend to publish “exciting” results rather than boring ones
  • 3: mass media compounds this problem; entertains science that can be interpreted as gossip
33
Q

Type II

A
  • when we fail to reject the null hypothesis, but the null hypothesis is false
  • Overall - when we incorrectly fail to reject the null hypothesis
  • Results in a failure to take action - EX: if a research intervention is not performed or a diagnosis is not given
34
Q

LECTURE

35
Q

Sampling distributions help us model…

A

the frequency of all possible outcomes by drawing lots of samples

36
Q

What is the notation for Null Hypotheses?

A

H2 (subscript)

37
Q

What is the notation for Research Hypotheses?

A

H1 (subscript)

38
Q

Research hypothesis: what can explain its effects?

A

We are unlikely to have gotten this effect if our sample if only random chance was operating

39
Q

Null hypothesis: what can explain its effects?

A

Random chance is likely an explanation for any effect we see in the sample

40
Q

What are the 3 steps in defining results?

A
  1. We have our hypotheses about the population
  2. We collected data on a sample
  3. Now…which hypothesis is the most consistent with our findings? (done with INFERENTIAL STATS)
41
Q

4 steps from results to hypothesis:

A
  1. describe research/null hypothesis
  2. calculate descriptive stats
  3. choose the best statistical stats
  4. make an inference about the population
42
Q

What statistic can tell us about the status of the null hypothesis (whether we can reject it or not)? What’s the ideal value?

A

Can reject the null when p < .05