Chapter 4: Collecting Data Flashcards

1
Q

Explanation for an outcome

A
  1. by chance
  2. discrimination

–> run simulation to find convincing evidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define Population

A

entire group of your interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Sample

A

subset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Census

A

Collects data from the entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Types of Bad Sample

A
  1. Convenience samples
  2. Bias
  3. Voluntary Response Sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define Convenience Samples

A
  • over/underestimate what you want to find from the population
  • introduces bias
  • produces samples that don’t reflect the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Bias

A
  • systematically favoring a certain outcome
  • when a study very likely to underestimate/overestimate what is being looked at
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Voluntary Response Sample

A
  • made up of people who choose to answer a general appeal
  • usually people with strong emotions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Types of Good Sample

A
  1. Simple Random Sample
  2. Stratified Random Sample
  3. Cluster Sample
  4. Systematic Random Sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define Simple Random Sample

A

An SRS of size __n__ is chosen so that every __GROUP__ of __n__ individuals has an equal chance to be selected as the sample

You must
1. numerically label the population
2. use technology to random digit table to get random numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define Sampling WITH replacement

A

individual can be selected more than once. repeats allowed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define Sampling WITHOUT replacement

A

individual cannot be selected more than once. repeats are ignored and not part of the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to select an SRS

A

put everything in a pile and pick randomly
- label [ex. 001-100, ignore 000 and 101-999]
- use random number generator [RandInt(1,100,# that you want)] to select # and context
- no repeats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define Stratified Random Sample

A

divide population into strata (homogeneous group) and chose an SRS from each group and combine.
- more precise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Difference between Simple and Stratified Random Sample

A

Simple: Every individual in the population has an equal chance of being selected

Stratified: the population is divided into groups (strata) based on characteristic, and then a random sample is taken from each stratum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to selected a stratified random sample

A

explain your choice of strata
- explain the strata [ex. strata can be types of books]
- randomly select __same number__ of __context__ from each __strata context__

“Not every GROUP of the same size has the same chance of being picked”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define Cluster Sample

A

create clusters (group) that “are located near each other”

randomly select a few clusters and include each member of the cluster.
- saves money and time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Difference between cluster and stratified random sample

A

cluster: the groups are heterogeneous since they are physically located near
stratified: you don’t select from every cluster

19
Q

How to select a systematic random sample

A
  1. calculation: (total amount)/(amount you want) = #
  2. pick a number randomly from 1 - # and add # each time.

ex. arrange all books in order. Randomly select 1 book from the first 40. Then choose every 40th after that.

20
Q

Define Systematic Random Sample

A

Your population is somehow ordered

randomly select one of the first k individuals and choosing every kth individual
- good when population is ordered
- easier to conduct

21
Q

What can go wrong

A
  1. undercoverage [members of the population have less of a chance of being chosen or left out]
  2. nonresponse [chosen individuals can’t be contacted or refuse to participate –> big issue]
  3. response bias [individuals lie or answer a question they don’t know]
  4. question wording bias [the way a question is worded or asked influences the response from an individual]
22
Q

Define Experiment

A

has treatment
need experiment to know CAUSATION

23
Q

Define Observational Study

A

No treatment
need observational study to know correlation

24
Q

Define Confounding Variable

A

other possible variables other than explanatory variable that affects response variable

25
Define Retrospective and Prospective
Retrospective = using existing data Prospective = tracks individuals into the future
26
Define Treatment
specific condition applied to the experimental unit
27
Define Experimental Unit/Subject
Person or thing treatment is randomly assigned to
28
Define Factor
Explanatory variable that is manipulated and may cause. change int the response variable.
29
Define Level
Different values of a factor
30
4 Principles of Experimental Design
1. comparison 2. random assignment 3. control 4. replication
31
Define Placebo Effect
when an individual responds to the dummy treatment
32
Purpose of Control Group
provides a baseline for comparison
33
Define single/double blind and their importance
Single-blind experiment = experimenters or subjects don't know what treatment they are getting or giving Double-blind experiment = both the subject and the experimenter do not know what treatment is given importance = blindness is important because it eliminates favoritism and/or placebo effect
34
Purpose of Random Assignment
1. create roughly equivalent groups 2. helps control confounding variables
35
How to randomly assign treatments to units
chance (not random sampling)
36
Define Replication
use enough subjects/experimental units so the outcome of the experiment can have meaning
37
Define Statistically Significant
the results most likely did not happen by chance convincing evidence (ran simulation)
38
Define Blocking
like stratified sample. experiment version. benefits: 1. controls confounding variables 2. increases chance of finding convincing evidence if the effect is real
39
How to determine to block
blocks should be homogeneous use a confounding variable
40
Define Matched Pairs Design
blocks only include 2 experimental units 1. 2 similar units 2. 1 unit that gets both treatment --> randomly assign treatments or randomly assign the order of treatment
41
Define Inference
using information from out sample/experiment to draw conclusions about the population
42
Define Sampling Variability
different samples from the same population will give different results
43
Define Margin of Error
"wiggle room" how far off our estimates is to the truth
44
completely randomized design for experiments
hexagon shaped diagram. LOOK AT THE NOTES