Statistics Theory L2 = Study Design Basics Flashcards

1
Q

How do we design & execute a study to collect data and draw inferences as reliably as possible?

A

Through using a Population-sample-direction-of-inference diagram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a Population-sample-direction-of-inference diagram?

A

= it is diagram used by scientists to direct how/what kind of inference to apply to their study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Population-sample-direction-of-inference diagram attributes? (4)

A
  • Statistical population is a fluid concept (fluffy).
  • Sample is a concrete concept (box).
  • From sample to statistical population, you get statistics (induction).
  • From statistical population to sample, you get probability (deduction).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Goal of the Population-sample-direction-of-inference diagram?

A

To help us think in terms of estimating one or more parameters from our designated statistical population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How could we do this, i.e., what could we be estimating? (2)

A

Could be:

  • Estimating more than 1 parameter in a model.
  • Estimating a difference between treatments in a randomised experiment.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

So, what do we have to do to achieve our goal?

A

We have to fit a model with parameters using data that we have collected (and the data are measured in terms of variables).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of variables?

A

= the function they serve in answering our question or what effect they have on the outcome or conclusion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Use of the types of variables?

A

They inform the type of analysis we choose.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Types of variables in terms of their function? (4)

A

Primarily, you have Explanatory & Disturbing variables. Under disturbing variables you have 2 kinds of variables namely, controlling & randomising variables.

So, essentially you have:

  • Explanatory variables.
  • Disturbing variables.
  • Controlling variables.
  • Randomising variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explanatory variables?

A

= variables we need to answer the questions we are interested in.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explanatory variables attributes? (4)

A
  • The focus of most scientific studies.
  • Represent the things we are actually interested in knowing something about.
  • Include x (predictor) and y (response) variables.
  • Focus on the function of the variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Predictor variable?

A

= variable that is purported by the hypothesis to cause the behaviour of the response variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Predictor variable attributes? (4)

A
  • Can be discrete or continuous.
  • Can be ordered & disordered.
  • If discrete, called a “factor”, as in the analysis of variance.
  • If continuous, called a “covariate”, as in studies that use regression analysis.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Response variable?

A

= variable of interest whose behaviour we want to predict (on the basis of our research hypothesis).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What choices does an investigator have in dealing with disturbing variables? (3)

A
  • Provide control for them.
  • Randomise them to remove/minimise their effect.
  • Ignore them.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Goals when using explanatory variables? (3)

A
  • Identify relationships between predictor & response variables.
  • Do this in the most unbiased & precise way as possible.
  • Very difficult in environmental science (esp. observational studies) due to the many external influences that we can’t control.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Disturbing/Extraneous variables?

A

= the things we need to control for or accommodate to get at the things we are interested in.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Disturbing varibales attributes? (2)

A
  • Represent extraneous influences that can affect the estimated relationship between explanatory variables, or our ability to measure them.
  • Can introduce bias into the study.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Eg of Disturbing variables?

A

Aerial game counts: we might be interested in animal abundance or distribution, but other factors get in the way of our ability to estimate those things accurately such as, detection probability of animals we count, observer experience & weather conditions on the day of the survey.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Types of Disturbing variables? (2)

A
  • Controlling.
  • Randomising.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Controlling variables?

A

= the subset of disturbing variables that we measure so that we can assess their effect on the variables & relationships of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Controlling variables attributes? (4)

A
  • Best way to deal with disturbing variables.
  • We control for variability explained by a “nuissance”.
  • “Nuissance” removes bias, and
  • Improves precision.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Eg of Controlling variables?

A

Snake mass vs length, without & with sex variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Explain the Controlling variables example? (4)

A
  • Mass & length are the explanatory variables.
  • We know that the sex will affect the relationship between mass & length.
  • With controlling variable.
  • Without controlling variable.
25
Q

With controlling variable in Controlling variable example effects? (3)

A
  • Included sex (male & female) as a controlling variable.
  • Less bias for male & female.
  • End up with more precision (lower variability around each line).
26
Q

Without controlling variable in Controlling variable example effects? (3)

A
  • Combined data.
  • Biased slopes (too low for males; too high for females).
  • Less precise (higher variability).
27
Q

Randomised variables?

A

= the subset of disturbing variables that represent the factors whose influence we are aiming to reduce through the randomisation process.

28
Q

Randomising/Randomised variables attribute?

A

Cannot be controlled through design or measurement of controlling variable.

29
Q

Why do we use randomising? (3)

A
  • To reduce the effect of unknown influences on our study.
  • To deal with factors that are not practical to measure, or we don’t know how to measure them.
  • To accommodate limits imposed by sample size.
30
Q

Example of Randomised variables? (6)

A
  • Let’s say we’re studying the amount of weight loss in fat Americans as a result of the diet they are given (diet A, diet B, diet C).
  • X variable = Diet A, Diet B, Diet C.
  • Y variable = Amount of weight loss.
  • Possible influences on amount of weight loss is genetics.
  • So, we distribute the fat Americans in each diet group in the hopes that genetics (or other possible uncontrollable factors) will have a low influence over the results.
  • By doing this you’re randomising the variables.
31
Q

What are our goals in using Randomisation & Replication? (3)

A
  • To be objective/unbiased in the selection of the population subset to study.
  • To ensure that our inferences can be applied to the intended target population (i.e., the sample is representative of the statistical population).
  • To control confounding variables.
32
Q

Randomisation attributes? (4)

A
  • Incorporates a “chance mechanism”.
  • Aims for representation.
  • Random assignment of treatments to experimental units.
  • Consists of 2 facets, namely, random sampling & randomised experiment.
33
Q

What will happen if there’s no randomisation? (3)

A
  • Can’t claim that results apply to the statistical population/entire target population.
  • Can’t infer cause-and-effect.
  • Research is vulnerable to disturbing variables (due to the lack of probabilistic design that accompanies randomisation).
34
Q

Replication attributes? (3)

A
  • We want to measure >1 experimental unit.
  • Or, we want to apply treatment to >1 experimental unit.
  • More experimental units, more ability to detect an effect (& make inferences about the population studied).
35
Q

Conditions/Criteria that experimental units should meet for Replication? (2)

A

Experimental units should be:

  • Physically separate from each other (temporally or spatially).
  • Independent applications of a treatment.
36
Q

Consequence of experimental units not meeting the conditions/criteria?

A

We’re at risk of pseudo-replication.

37
Q

Pseudo-replication?

A

= involves treating non-independent experimental units as independent.

38
Q

Eg of Replication?

A

Imagine we have 2 lakes containing fish. We treat 1 lake with some chemical, and take a sample of 30 fish from each lake. How many replicates do we have?

39
Q

Why is Replication important?

A

To enable researchers to detect an effect in the experimental units.

40
Q

Types of study designs? (4)

A
  • Manipulative experiment.
  • Quasi-experiment.
  • Mensurative study.
  • Descriptive study.
41
Q

Manipulative experiment?

A

= the predictor (explanatory) variable is under the control of the researchers.

42
Q

Manipulative experiment attributes? (3)

A
  • Not necessarily in a lab.
  • Have random allocation of treatments & controls to experimental units.
  • Has replication.
43
Q

Quasi-experiment?

A

= environmental science studies where replicated, manipulative experiments are not feasible.

44
Q

Quasi-experiment attributes? (4)

A
  • Circumstances might require a sacrifice in terms of randomisation or replication.
  • Compromise our ability to make statistical inferences.
  • “Manipulation” is not necessarily under the control of researchers.
  • Impact assessments of development projects.
45
Q

Mensurative study?

A

= observational study where no researcher-controlled manipulation occurs.

46
Q

Mensurative study attributes? (5)

A
  • Random sampling is possible.
  • Generally no allocation of experimental units.
  • Comparisons take the place of experiments.
  • Findings are correlative.
  • Useful for evaluating patterns & processes within the population of interest.
47
Q

Descriptive study attributes? (4)

A
  • Provide description of a system of interest.
  • Not designed to answer “how” or “why” questions.
  • No replication, randomisation & comparison.
  • Increase our knowledge about the system of interest.
48
Q

Know the differences between what terms? (3)

A
  • Accuracy.
  • Precision.
  • Bias.
49
Q

Accuracy?

A

= a function of bias & precision.

50
Q

Precision?

A

= how close repeated estimates of the same parameter are to each other.

51
Q

Bias?

A

= the distance between an estimate of a parameter & the true value of that parameter.

52
Q

What is it that we want/aim for for our study design? (2)

A
  • High precision.
  • Low bias.
53
Q

High precision, low bias?

A

= repeated estimates are close together & close to the true value (to the truth).

54
Q

High precision, high bias?

A

= repeated estimates are close together & far from the true value.

55
Q

Low precision, low bias?

A

= repeated estimates are far from each other & close to the true value.

56
Q

Low precision, high bias?

A

= repeated estimates are far from each other & far from the true value.

57
Q

Control?

A

= an unmanipulated experimental unit.

58
Q

Control atributes? (2)

A
  • Are also treatments.
  • Benchmarks used to evaluate the impacts of treatments on the system response.
59
Q

Experimental unit VS Observational unit?

A

An experimental unit differs from an observational unit in that observational units can be samples taken from the experimental unit.