Week 1 Flashcards

1
Q

Five stages of statistical process.

A

Hypothesis - Question of interest
Study Design - how to collect information
Collect Data
Analyse Date - descriptive statistics (understanding), inference (modelling)
Present Results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the six principles of statistics?

A

-collection
-classification
-organisation
-analysis
-interpretation
-presentation of information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explanatory variable.

A

Piece of information that we are interested in. We observe differences in the explanatory variable and notice whether they are related to the response variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define population.

A

Entire set of objects of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define sample.

A

Subset of the set of objects of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is statistical inference?

A

Analysing a sample from a population to make inferences about a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a designed experiment? Give an example.

A

Researcher exerts control over the experimental units. Researcher gives medicine to some patients placebo to others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an observational study? Give an example.

A

Researcher observes the experimental units and records the variables of interest. E.g. To assess study habits, students are asked how they prepared for an exam and then their grades are compared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is quantitative data?

A

Measurements that are recorded on a natural numerical scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is continuous data?

A

Quantitative Data; measurements of which can fall anywhere on the real line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is discrete data?

A

Quantitative data; measurements of which can only take one of a finite set of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is qualitative data?

A

Measurements that cannot be recorded on a natural numerical scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is nominal data?

A

Is qualitative data with no meaningful ordering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is ordinal data?

A

Is qualitative data with an inherent order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a barchart represent?

A

Shows the distribution of categorical values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does a histogram represent?

A

Shows the distribution of numerical values.

17
Q

What does a scatterplot represent?

A

Shows the relationship between variables.

18
Q

What are bins?

A

Frequency on the x-axis in a histogram representation.

19
Q

What are the measures of location? (4)

A

Mean
Median
Mode
Percentiles

20
Q

What are the measures of spread?

A

Variance
Standard Deviation
Range
IQR

21
Q

Explain box plots.

A

Median is the horizontal line in the box.
Top of box is 3rd Q
Bottom of box is 1st Q
Arms stretch to 1.5 times 1Q and 3Q respectively.
Dots outside arms represent outliers.

22
Q

Define standard deviation.

A

Records the root mean squared deviation of values from the mean.

23
Q

What is the relationship between sample standard deviation and sample variance?

A

sample variance = (sample standard deviation) ^2

24
Q

Formula for sample s.d.

A

s = root(1/n-1 * (sum (xi - (Xbar)^2)))

where xi is each value. Xbar is sample mean. n is sample size.

25
Q

Formula for s.d.

A

s = root(1/N * (sum (xi - (X)^2)))

where xi is each value. X is population mean. N is population size.

26
Q

Formula for sample variance.

A

s = 1/n-1 * (sum (xi - (Xbar)^2)))

where xi is each value. Xbar is sample mean. n is sample size.