Data and Graphical Summaries Flashcards

1
Q

Observational studies

A

The investigators have no control over the subjects or quantities of interest; they are just observers.
The investigators cannot use randomisation for allocation into groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

association

A

that one thing is linked to another
easy to establish

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Does association prove causation?

A

No it does not, it may suggest it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Precautions in observational studies

A

(I) It is very difficult to establish causation
(II) Observational studies with a confounding variable can lead to Simpson’s Paradox
(III) Historical control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Control

A

a subject who did not get the treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Controlling for confounders

A

trying to reduce the influence of confounding variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

contemporaneous control

A

Controlled experiments need to be performed in the same time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

causation

A

the relationship between cause and effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Historical control

A

Sometimes time is a confounding variable - contemporaneous control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Simpson’s paradox

A

The association between a pair of variables (X,Y) reverses sign upon conditioning of a third variable Z, regardless of the value taken by Z.

Sometimes there is a clear trend in individual groups of data that reverses when the groups are pooled together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

IDA - Initial data analysis

A

First general look at the data, without formally answering the research questions
Helps you see whether the data can answer research questions
Identify the data’s main qualities
Suggest the population from which a sample derives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

IDA process

A

Data background: checking quality and integrity of the data
Data structure: what info has been collected?
Data wrangling: scraping, cleaning, tidying, reshaping, splitting, combining
Data summaries: graphical and numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variables

A

measures or describes some attributes of the subjects
Data with p variables is said to have dimension p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Graphical summaries

A

Best highlight features of this data
To some extent we use trial and error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Big data

A

Refers to massive amounts of data being collected
High dimensional: more variables than subjects
Requires more complex visualisations, complicated machinery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Data

A

information about the set of subject being studied