Quizzes Flashcards

1
Q

Probability is the process of drawing conclusions about the sample based on population data.
T/F

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Big Picture Statistics order

A

1-Exploratory data analysis
2-Produce data
3-Probability
4-Inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Multistage Sampling

A

In statistics, multistage sampling is the taking of samples in stages using smaller and smaller sampling units at each stage. Multistage sampling can be a complex form of cluster sampling because it is a type of sampling which involves dividing the population into groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Stratified Sampling Vs Cluster Sampling

A

Cluster sampling divides a population into groups, then includes all members of some randomly chosen groups. Stratified sampling divides a population into groups, then includes some members of all of the groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Observational Study vs Experiment

A

Observational Study: Individuals choose which treatment to receive or naturally belong to one of the treatment groups
Lurking variables that influence choice confounded with treatments
Passive data collection: observing, measuring, counting, subjects are undisturbed
Media often improperly attribute cause-effect conclusions to these

Experiment: A study design where treatments are imposed on subjects before observing response (manipulations, interventions)
To determine if treatments cause change in response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Quantitive vs. Categorical

A

Quantitive variable : variable whose possible values are meaningful numbers : cost, height, yield
Categorical variable: variable whose possible values are non-quantitative categories ex; gender, option

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Lurking Variables

Control

Confounding

A

Lurking variables: variable that is related to the explanatory variable and the response variable and could affect your interpretation of the relationship among those variables
Control: an effort to reduce effects of lurking variables
Confounding: situation in which effects of lurking variable cannot be distinguished from effects of factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Under-coverage
vs
Non-response
vs
Misleading response

A

Under-coverage
Some individuals have no possibility of being selected
Ex: homeless, phoneless

Non-response
Some individuals refuse to answer or can’t be contacted
Ex: hang-ups, on vacation, refusal to mail census form

Misleading response
Selected individuals lie or give inaccurate answer (sensitive issues)
Ex: do you wash your hands, have you cheated, do you struggle with mental health?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Response variable
vs
Explanatory variable

A

A response variable is the expected effect, and it responds to other variables.
The response variable is measured on the individual.

An explanatory variable is the expected cause, and it explains the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Control / Comparison

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Benefit of Experiment

A

An experiment can be used to establish causation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Randomized Block Experiment
vs
Randomized Controlled Experiment
vs
Matched Pairs

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the advantage of a histogram over a stemplot or dotplot?

A

Histograms work well for very large data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mean vs Median
3 steps

A

1.first construct histogram or stem plot, evaluate skewness and outliers
2.use median if markedly skewed or outliers are present
3.use mean if roughly symmetric

*Approximately equal if histogram is roughly symmetric
*Median “resistant” to outliers and long tails
*Mean has desirable properties for inference (much more on this later)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Mean + Median and Histogram

A

x(bar)= mean
M= median
symmetric distribution: x=M
left skew: x<M
right skew: x>M

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Random Variable
vs
Continuous Random Variable
vs
Discrete Random Variable

A

characteristic that is measured on each individual; e.g., cost, height, yield, sex

variable that can take on any value in an interval so that all possible values cannot be listed; e.g., time, height, temperature

variable whose possible values are a list of distinct values; e.g., sex, opinion, # of arrests, shoe size