Exploring quantitative data Flashcards

1
Q

What is the research process?

A

A process that uses the scientific method to establish, confirm and/or reaffirm certain pieces of knowledge supported by strong evidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why do we use the research process?

A
  • Create theories
  • Find solutions to problems
  • Find problems to solutions
  • Find some sort of truth
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the research design encompass?

A

Plan of sampling, data collection, measurement and analysis

Methodology
Study design
Data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types of study designs in medicine

A
  1. Basic studies
  2. Observational studies
  3. Experimental (interventional) studies
  4. Economic evaluations
  5. Meta-analysis/systematic review
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an observational study?

A
  • Non-interventional
  • variables are not manipulated by researchers
  • Researchers observe natural relationships between factors and outcomes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Types of observational studies

A
  1. Cross-sectional studies
  2. Longitudinal studies
  3. Case-control studies
  4. Cohort studies
  5. Survey studies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Cross-sectional study?

A

A study that assesses a population, as represented by the study sample, as a single point in time
–> they reflect the situation of a disease or clinical outcome at a particular moment in a particular population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cross-sectional study example

A
  • Enrolling current smokers or never smokers and assessing whether or not they have decreased lung function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is a cross-sectional study conducted?

A
  1. Participants recruited based on inclusion and exclusion criteria
  2. Study the exposure and outcome at the same time
  3. Estimate the prevalence (of outcome and exposure as well) –> calculate odds ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a longitudinal study?

A

They use continuous or repeated measures to follow a particular individual over prolonged periods of time (year or decades)

  • -> observational in nature
  • -> collects quantitative and qualitative data without external influence being applied
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Longitudinal study example

A

The longitudinal study of the Office of Population Censuses and Surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a case-control study?

A

Researchers identify study participants based on their case status (i.e. diseased or not diseased)
–> Quantification of the number of individuals among the cases and the controls who are exposed allows for statistical associations between exposure and outcomes to be established

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Case-control study example

A

Analysing the relationship between obesity and knee replacement surgery
–> Cases are participants who have had knee surgery, and controls are a random sampling of those who have not, and the comparison is the relative odds of being obese if you have knee surgery as compared to those that do not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are cohort studies?

A

involve identifying study participants based on their exposure status and following them through time to identify which participants develop the outcome(s) of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cohort study example

A

cohort of 5766 men aged 35–64 at the time of examination who were recruited from workplaces in the west of Scotland between 1970 and 1973. The study investigated the association between socioeconomic position in early life (when the participants were children) and cause specific mortality. Relative rates of mortality adjusted for age for men with fathers in manual versus non-manual occupations were 1.52 (95% confidence interval 1.24 to 1.87) for coronary heart disease, 1.83 (1.13 to 2.94) for stroke, 1.65 (1.12 to 2.43) for lung cancer, 2.06 (0.93 to 4.57) for stomach cancer, and 2.01 (1.17 to 3.48) for respiratory disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are survey studies?

A

Research based off of surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Benefit of surveys

A

enables the researcher to describe the characteristics of the sample being studied and to make generalisations to the larger population of interest. Surveys are particularly useful for collecting information about research phenomena that are not directly observable or measurable. They are also useful for collecting data from people who are widely distributed geographically, since direct contact between researcher and research participant is not necessary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Types of surveys

A
  1. Epidemiological
  2. Surveys on attitudes to a health service intervention
  3. Questionnaires assessing knowledge on a particular topic or issue
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are interventional studies?

A

those where the researcher intervenes or manipulate the variable (s) at some point throughout the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Types of interventional studies

A
  1. RCT
  2. Pre-post studies
  3. Non-randomised controlled trials
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is an RCT?

A

a trial in which subjects are randomly assigned to one of two (or more) groups: one (the experimental group) receiving the intervention that is being tested, and the other (the comparison group or control) receiving an alternative (conventional) treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Features of an RCT

A
  • The sample to be studied will be appropriate to the hypothesis being tested so that any results are appropriately generalisable. The study will recruit sufficient patients to allow it to have a high probability of detecting a clinically important difference between treatments
  • There will be effective (concealed) randomisation to eliminate bias
  • Both groups will be treated identically in all respects except for the intervention being tested and to this end, patients and investigators will ideally by blinded to which group an individual is assigned
  • The investigator assessing outcome will be blinded to treatment allocation
  • Patients are analyzed within the group to which they were allocated, irrespective of whether they experience the intended intervention (intention to treat analysis)
  • The analysis focuses on testing the research question that initially led to the trial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are pre-post studies?

A

A pre-post study measures the occurrence of an outcome before and again after a particular intervention is implemented.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Example of pre-post study

A

comparing deaths from motor vehicle crashes before and after the enforcement of a seat-belt law.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are the non-randomised trials?

A

interventional study designs that compare a group where intervention was performed with a group where there was no intervention –> suggest possible relationships between the intervention and the outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Negative of non-randomised controlled trials

A

Often subject to bias or error

27
Q

Why do we want to visualise data?

A

to visualise the distributions of the variables and relationships between them. This allows you to become familiar with the data before carrying out analysis, reveal possible data entry errors, and discover unexpected patterns in the data.

28
Q

3 aspects of distribution

A
  1. Location
  2. Spread
  3. Shape
29
Q

What is the location of the variability?

A

A typical value taken by the variable

30
Q

What is the spread of the variability?

A

How far the values extend from the centre

31
Q

What is the shape of the variability?

A

whether or not values are spread symmetrically on either side of the centre

32
Q

Continuous variables can be explored using plots such as…

A
  • Dot plots
  • Histograms
  • Density plots
33
Q

What are dot plots?

A
  • Draw an axis

- Put a dot for each response

34
Q

What is alpha blending?

A

giving dots transparency so that you can see where points are overlapping

35
Q

What is a jittered dot plot?

A
  • The vertical axis has no meaning except as a way to separate the dots –> can see more detail and clusters
36
Q

What are dot plots useful for?

A
  • Comparing the distribution between groups
37
Q

Negative of dot plots

A
  • Hard when you have lots of points
38
Q

What are histograms useful for?

A
  • Showing shape of the distribution
  • Visualising large numbers of observations
  • Provide a good picture of the location, spread and shape
39
Q

How to make a histogram

A
  • Divide the range of height into bins then count how many of the values fell into each bin
40
Q

Details of skewed data

A
  • Look at the tail of the data

- can be skewed to the right (positively skewed) or the left

41
Q

Negative of histograms

A
  • Don’t have a lot of control over them since you only have discrete choices for the number of bins
  • Changing the number of bins can give quite a different picture
  • They are somewhat subjective (based on the choice of bins)
  • Difficult to compare more than two groups
42
Q

What is a bimodal distribution?

A

When the distribution has two peaks

43
Q

What are density plots?

A
  • Alternative to histograms that show a continuous estimate of density
44
Q

What are density plots useful for?

A
  • Comparing distributions between groups since it is easy to overlay them
45
Q

Negative of density plots

A

Need software to create them (whereas histograms can be calculated and drawn by hand if needed)

46
Q

What is the median?

A

Lining up all the values in order from smallest to largest and seeing what is in the middle

47
Q

Other names for the median

A

50th percentile

0.5 quantile

48
Q

What is the 25th percentile (0.25 quartile) called?

A

the FIRST QUARTILE

49
Q

What is the 75th percentile (0.75 quartile) called?

A

the THIRD QUARTILE

50
Q

What is the 0th percentile (0.00 quantile) called?

A

the MINIMUM

51
Q

What is the 100th percentile (1.00 quartile) called?

A

the MAXIMUM

52
Q

How to estimate first quartile

A

Median of values BELOW the median

53
Q

How to estimate third quartile

A

Median of values ABOVE the median

54
Q

What is the five-number summary?

A

Minimum, first quartile, median, third quartile, maximum

55
Q

What is the interquartile range?

A

Distance between the first and third quartiles
E.g. first quartile = 51; third = 65
therefore the interquartile range = 65-51
= 14

56
Q

Positive of IQR

A

It is not affected by the tails of the distribution

57
Q

What is a box and whisker plot

A

Shows the location, spread and shape of the distribution as well as flagging unusual observations

Box = Q3, M & Q1 
Whiskers = Min & Max
58
Q

How can data be flagged as unusual?

A

If they are 1.5 times higher or lower than the Q3 and Q1

59
Q

What are boxplots useful for?

A
  • Comparing distribution
60
Q

What is an outlier?

A

A data value that does not seem to match the overall distribution observed
- Can be genuine, or made from experimental errors or mistakes in data entry

61
Q

Categorical variables (nominal or ordinal) such as gender or degree program are most often graphed using…

A

a BAR CHART

62
Q

What do bar charts show?

A

frequencies in each category or the percentage of participants in each category by using different bar heights (for a vertical bar graph) or lengths (for a horizontal bar graph)

63
Q

What are bar charts useful for?

A

comparing frequencies across a range of categories