Data Collection Flashcards

Methods for data collection, sampling strategies

1
Q

What is probability theory?

A

The study of the mathematical rules that govern random events.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an event?

A

A possible outcome of an experiment or observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is descriptive statistics?

A

The use of statstics to summarise a set of known data in a clear and consise way.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is statstical inference?

A

The methods and practice of forming judgements about the parameters of a population, usually on the basis of random sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is statistics used for?

A
  • designing experiments and other data collection
  • summarising info to aid understanding
  • drawing conclusions from data
  • estimating the present or predicting the future
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the general steps used in stats?

A
  1. pose a question, what are the objectives of the study?
  2. decide what to measure and how to measure it
  3. collect or generate data
  4. explore data, check for oddities
  5. calculate formal statistical summaries and carry out tests to answer posed question
  6. explore sensitivity of analysis to assumptions
  7. communicate findings
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three general methods for collecting or generating data?

A
  • polls and surveys
  • experiments
  • observational studies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a target population?

A

A collection of objects/individuals we want to learn something about.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a sample?

A

A subset of the target population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a census?

A

When information from the whole target population is obtained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a poll or survey?

A

The process of collecting data from a sample in order to obtain info about the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the cons of censuses?

A
  • difficult to make sure everyone participates
  • more expensive than surveys
  • takes longer
  • more practical to take survey
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a sampling error?

A

A single random sample will not give an exact “right answer”; it is unavoidable without taking a census.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do we choose a sample?

A
  • each member of the population has an equal chance of being chosen
  • avoids bias
  • allows for the calculation of the likely size of sampling errors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a variable?

A

Some characteristic of each individual in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a parameter?

A

A numerical summary of a variable for a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a statistic?

A

A numerical summary of a variable for the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is precision?

A

The value of the statistic is similar in all samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a bias?

A

It implies that the sample statistic tends to differ from the population parameter in a consistant way (there is a systematic error).

20
Q

Describe two simple sampling strategies.

A
  1. number every member of the population, draw numbers from a hat
  2. calculate a fixed periodic interval:
    k = N/n
    where k is the inverval between successive samples, n is the sample size and N is the population size
    pick a random value between 1 and k (call this q)
    sample the qth individual, then the (q + k)th, then the (q + 2k)th, etc
21
Q

What are the benefits of systematic sampling over random sampling?

A
  • easier; only one random number has to be drawn

- BUT if population contains some periodic variation which lines up with k, sample will be biased

22
Q

What is stratified sampling?

A

The population is divided into different categories (or strata) and different samples are taken from each stratum.

23
Q

What is selection bias?

A

Introduced when the selected sample is not representive of the population of interest.

24
Q

What is self-selection bias?

A

Occurs when people decided whether they wish to participate in the survey or not.

25
Q

What is interviewer effects?

A

Different interviewers asking the same question can obtain different results.

26
Q

What is non-response bias?

A

When target population don’t respond and non-respondents tend to behave differently to repondents.

27
Q

What is question effects?

A

Subtle variation in question wording can have an effect on the responses.

28
Q

What is survey format? (As a reason for bias)

A

Results may be affected by factors such as question order or if the survey is conducted by mail, by phone or in person.

29
Q

What are behavioural considerations?

A

People tend to answer questions in a way they consider to be socially acceptable.

30
Q

What does transferring findings mean?

A

When results from one population are transferred to another.

31
Q

What is an experiment?

A

Used to try and discover whether a “treatment” or “condition” has an effect on individuals.

32
Q

What is a randomised experiment?

A

Treatments are randomly allocated to experimental units.

33
Q

What are subjects?

A

Experimental units that are human.

34
Q

What does partitioned mean? (Or, what are blocked experiments)

A

Individuals are assigned to groups (eg male/female), this reduces natural variability.

35
Q

What is a placebo?

A

A drug with no known effects.

36
Q

What is double blinding?

A

When the doctor doesn’t know which treatment they are offering, neither does the patient, stops doctors having differences in behaviour.

37
Q

What is a control group?

A

A group of units not given any form of treatment.

38
Q

How many times should experiments be repeated?

A

At least twice the number of treatments.

39
Q

What is an observational study?

A

No treatment given to subjects, instead naturally occuring events are recorded, used sometimes when its difficult to carry out a randomised experiment.

40
Q

What are the two types of observational study?

A
  • prospective (looking forward, choose samples now and follow up in the future)
  • retrospective (looking backward, choose samples and examine previous data)
    special case (case-control): seperate samples into cases and controls, look back and compare histories
41
Q

Can observational studies provide conclusive evidence for causation? What about a randomised experiment?

A

obs study - no

exp - possible to argue causation

42
Q

What is a confounding variable?

A

Some factor that’s not accounted for, introduces a difference in treatment groups, NOT due to treatment.

43
Q

Name two types of sampling for an area (of land for example).

A
  • plot sampling (completely random or systematic grid (which in turn can be random or systematic))
  • strip sampling (simple random or systematic sampling with random start)
44
Q

What is a response variable?

A

Variable/info we would like to describe or predict.

45
Q

What is an explanatory variable?

A

Covariates/info we expect might help us explain or predict the response variable.