Targets 3A-3G Flashcards

1
Q

Population

A

The larger group we hope to learn something about

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Census

A

Collect data from all members of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample

A

Subset of the population that actually gets examined or measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Inference

A

Drawing a conclusion about a population based on data from a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sampling Variability

A

We don’t expect the statistic from a sample to be the same as if we calculated it from the population; each sample produces a different result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Random Sampling

A

typically produces a sample that is representative of the population

allows us to use the laws of probability to construct a margin of error

Larger ones give better information about a population than smaller ones

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sample survey

A

A study that uses an organized plan to choose a sample that represents some specific population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Good sample

A

representative of the population and will provide a good estimate of the value of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Biased sample

A

Systematically over or under represents a portion of the population and would consistently overestimate or underestimate the value of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Random Sampling Process

A

Use a chance process to determine which members of a population are included in the sample

Sampling without replacement: a member of the population can only be selected once

Sampling with replacement: a member of the population can be selected more than once

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sampling frame

A

the list of all the potential subjects in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Chance processes

A

flipping a coin, rolling dice, drawing names out of a well-mixed hat, random number generator, random digit table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Random Sampling Benefits

A

is the number 1 way to fight bias. It avoids favoritism by the sampler and self-selection by the respondents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Simple Random Sampling (SRS) Characteristics

A

Every group of n individuals in the population has an equal chance of being selected for the sample.

Equivalent to throwing all members of a population into a hat, mixing them up, and then drawing without replacement

NOT the only legitimate method of random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Simple Random Sampling (SRS) Process

A

Choosing a sample of size n from a population of size N

  1. Assign random numbers between 1 and N to each member of the population
  2. Use randomness (calculator or table) to select n unique numbers. Be sure to state: ignore unused numbers; ignore repeats (for sampling without replacement)
  3. The subjects corresponding to the chosen numbers are in the sample.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Stratified Random Sampling

A

Split the sampling frame into homogeneous groups, then pick a random sample from each group.

Benefit :1 Reduces variability in the sample statistic (Field of Dreams Lab)

Benefit 2: Guarantees that all subgroups are represented in the sample; allows for subgroup comparisons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Cluster Random Sampling

A

Split the sampling frame into heterogeneous clusters (members that are in close proximity to each

other), then randomly choose several clusters.

Sample ALL subjects in the randomly chosen clusters.

Benefit 1: Can make sampling less expensive by reducing travel time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Systematic Random Sampling

A

Begin with a randomly selected individual; every nth!” member after that will be in the sample.

Larger values of n generate smaller samples. Smaller values of n generate larger samples.

Benefit: Can be easier to use than an SRS especially if the population is lined up (or listed) already

If we want a sample of size 3, we’ll choose every 5th member of the population. (15 ÷ 3 = 5)

To begin, we choose a random number between 1 and 5. Our random number was 4, so the 4th

member is the first one for the sample. We also include every 5th member after that.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Biased Sample

A

Systematically over or under represents a portion of the population and would consistently overestimate or underestimate the value of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Convenience Sample

A

select individuals from the population who are easy to reach

Example: A farmer brings a juice company several crates of oranges each week. A company inspector looks at

10 oranges from the top of each crate before deciding whether to buy all the oranges.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Voluntary Response Bias

A

happens when the sample is made up of subjects who chose to participate

Example: In 2016, Britain’s Natural Environment Research Council used an online poll to choose the name of

its new $300 million ship. The winning name “Boaty McBoatface” received 124,000 votes, far more than more

serious candidates “Shackleton”, “Endeavor”, and “Falcon”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Undercoverage Bias

A

happens when a subgroup of the population has little to no chance of being chosen

Example: In 1936 legitimate polling organizations such as Gallup were just getting started. That same year the

magazine Literary Digest, used the phone book to send out a presidential poll. 2.4 million out of 10 million

mailed ballots were returned. 57% said they would vote for Alf Landon (Rep) and 43% said they would vote

for Franklin Roosevelt (Dem). In the actual election, Landon only won electoral votes from two states.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Response Bias / Question Wording bias

A

happens when the method of the survey (or the wording of the questions) influences the responses given by subjects

Example: How do the results from the Pierre, SD survey illustrate response bias?

24
Q

Nonresponse bias

A

happens when a portion of the chosen sample does not respond to the survey; the non-respondents may share a common characteristic such as being stressed or busy

Example: With our modern-day caller ID feature, many people do not answer their phone when a polling

organization calls.

25
Q

Observational Study

A

researchers observe their subjects

no treatment is imposed on the subjects

there is no control over other variables

vulnerable to confounding variables

26
Q

Retrospective observational study

A

data are collected from past events or records; could be unreliable if subjects must recall information

27
Q

Prospective observational study

A

subjects are identified at the beginning of the study and measurements are taken as the study unfolds; this could get expensive

28
Q

Explanatory variable

A

a variable that we think explains or causes changes in the response variable

29
Q

Response variable

A

the outcome being measured

30
Q

Confounding Variable

A

a variable that is related to the explanatory variable and influences the response variable. It may create a false perception of an association between the explanatory and response variables.

31
Q

Writing about a confounding variable

A
  1. Link the confounding variable to the explanatory variable.
  2. Link the confounding variable to the response variable.
32
Q

Experiment

A

researchers randomly assign treatments to the subjects

33
Q

Nature

A

Was a treatment randomly assigned to the subjects?

• No! We can only draw the conclusion that the two variables are associated.

• Yes! We can draw a cause-and-effect conclusion.

34
Q

Scope

A

Was the sample randomly chosen?

• No! We may not generalize our conclusions to the larger population.

• Yes! We may generalize our conclusions to the larger population.

35
Q

experimental unit / subject

A

the individuals to which treatments are applied

36
Q

explanatory variables / factor

A

variables whose levels are manipulated intentionally

37
Q

levels

A

dosages or durations of a factor

38
Q

treatments

A

the specific combinations of levels that are imposed on the groups of subjects

39
Q

placebo

A

a treatment that has no active ingredient but is otherwise like other treatments

40
Q

placebo effect

A

subjects show an improvement after receiving a placebo simply because they expect an

improvement

41
Q

blinding

A

keeping human participants unaware of the treatment applied in order to reduce response bias

42
Q

single blind study

A

subjects do not know which treatment they are receiving, but members of the

research team who interact with them do, or vice versa

43
Q

double blind study

A

neither the subjects nor the members of the research team who interact

with them know which treatment a subject is receiving

44
Q

inference

A

using the results from a sample to draw a conclusion about the population

The results of a study are considered to be statistically significant when the observed results are too unusual

to be explained by chance alone.

45
Q

Comparison

A

compare two or more treatment groups

46
Q

Random Assignment Benefits

A

creates roughly equivalent groups of subjects

to balance the effects of unknown variables; also reduces bias

47
Q

Control

A

control the levels of the explanatory factor and keep all other variables as similar as possible;

control prevents confounding

48
Q

Control Group

A

group that receives the currently accepted treatment or no treatment at all; this

counts as one of the levels

49
Q

Replication

A

apply the treatments to many subjects; also, run the study multiple times (in another location)

50
Q

Random Chance Experiments

A

We rely on random chance to create groups that resemble each other so that the only differences between the

groups are the treatments imposed.

If we find that one group’s results are better than the other’s we can be reasonably certain that the explanatory

variable caused the difference.

51
Q

Random Assignment of Treatments

A

If group sizes don’t need to be equal we can use a coin flip, roll dice or use a random digit table,

allowing repeats.

If group sizes must be equal we can write treatments on slips of paper or use a random digit table,

ignoring repeats.

Don’t flip a coin until one group is filled. Putting the rest of the subjects in the other group could

unintentionally create a confounding commonality in that group.

52
Q

Describing a Completely Randomized Experiment

A
  1. Begin with a group of subjects.
  2. Randomly assign subjects to the treatment groups. State the treatments in context and explicitly link

each random outcome to a treatment.

  1. State the response variable in context that will be compared at the end of the experiment.
53
Q

Randomized Block Experiment

A

The pool of subjects is purposely divided into blocks of homogenous subjects.

Within each block subjects are randomly assigned to treatments.

At the end of the experiment the measurements from each treatment group will be compared to the other treatment groups in the same block

We block on variables that we believe will affect the response variable so that the only differences between the treatment groups are the treatments

Blocking reduces variability in the results (much like stratification in sampling)

54
Q

Statistics Haiku Rule 1

A

Control what you can - make the circumstances as similar as possible for all subjects, except for the explanatory variable – reduce confounding

55
Q

Statistics Haiku Rule 2

A

Block on what you can’t control - if there is a variable that is suspected to affect the response variable, block on that variable to reduce confounding

56
Q

Statistics Haiku Rule 3

A

Randomize the rest - Random assignment creates similar treatment groups. The effects of any unknown confounding variable should be minimized if it is equally represented in all groups

57
Q

Matched Pair Design

A

The pool of subjects is purposely divided into blocks of size two (pairs)

Two subjects who are as alike as possible have treatments randomly assigned to them.

One subject provides two measurements with the order of treatments randomly determined

At the end of the experiment the differences between the 2 measurements will be analyzed

matched pair studies are very powerful because they can control for many many variables