Data Analytics Flashcards

1
Q

Simple Barchart Function

A

Distribution of 1 categorical variable (eg sex % of total population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Simple Histogram Function

A

Distribution of 1 continuous variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Dot Plot/ Composite boxplot

A

Association between 1 categorical and 1 continuous variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Simple Scatter plot (both regression and smooth line)

A

2 continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Composite and Stacked Barchart Function

A

2 categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Realistations Def.

A

Measured observed points from population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Frequency Distributions Def.

A

Ordered display of each value in data set together with how often it appears in data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Relative Frequencies Def.

A

% of sample points that have a particular value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why do bar graphs fail

A

There’s a zone of irrelevance and data can be obscured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Left Skewed Def.

A

Most data sits on left. Tail falls to right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Right skewed de.

A

Most data sits on rate. Tail falls left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which value is greater in left skewed distribution (when not equal)

A

mean < median < mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which value is greater in right skewed distribution (when not equal)

A

Mean > median > mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Variance Def.

A

Averaged squared difference from mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard Deviation Def.

A

Average distance of scores to mean (square root of variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How To Choose Between sd and IQR

A

Which is closer to range

17
Q

Pearson’s Correlation: R value Outline

A

Analyse the association between 2 continuous variables. The closer to 1 or -1 the more linear the association. Th closer to 0 there’s no association

18
Q

Relative Risk Outline

A

Measures magnitude and direction of 2 categorical variables association. Interchangeable with chance, probability and proportion. Use relative % (% being observed in group’s in isolation not with respect to sample total)