2 (1) Statistics Flashcards

1
Q

What do we use statistics for?

A

Describing data, applying normative data to clinical practice, looking for associations, seeing whether variables are similar or different and if this is down to chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Pros and cons of using statistics

A

Concise - can filter down info into numbers
Generalisation of findings to wider population

Numbers remove context and meaning
Still need qualitative data, as numbers can’t convey subtle differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of variables

A

Got category/scale
And within scale (ordinal versus interval/ratio)

  • discrete or nominal or categorical
  • ordinal variables
  • Continuous or scale variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Discrete/nominal/categorical

A

Classify data into categories (e.g. gender)

Example: Y or N.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ordinal variables

A

Order matters but not the actual differences between the numbers

Examples self rating scales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

continuous/scale variables

A

Values are along the scale. There is order; differences in magnitude.

Example: age, income, grades

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How would you describe scale variables?

A

Data distribution: Normal and skewed distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In what case could we put the distribution of the data in histogram

A

If the date is continuous and we have enough data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is normal distribution?

A

After plotting frequencies on a histogram, we can get a symmetrical bell-like curve. This is known as normal distribution. The largest portion cluster in the middle.

The relevant values are mean and standard to you soon.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What’s skewed distribution?

A

Distribution (on histogram) is not symmetrical.

The relevant values are the median and range.
Skewness value must be above +1 and below -1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Positive versus negative skew.

A

Positive skew: most people score in the lower range. Mean>median.

Negative skew: most people score in the high range.
Median>mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bimodal distribution

A

Two or more central clusters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the measures of central tendency?

A
  • mean
  • median
  • mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mean

A

Average score obtained by adding all the scores and dividing the number of cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Median

A

Results are put in numerical order and the middle value is found. It is less affected by extreme scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Mode

A

Most freq occurring number

17
Q

How can mean median and mode be affected?

A

More normal the distribution, the closer the three measure of central tendency are.

Mean is sensitive to the range of data and outliers.

In normal distribution: mean is good descriptor of data.
Skewed distribution: median better descriptor

18
Q

Range and outlier

A

Range: highest and lowest score; minimum and max

Outlier: value in data that is markedly different to the others

19
Q

Aside from using mean, median, mode, what’s another way of looking at the spread of data?

A

Calculate the difference between each score and mean.

Variance: the average of the squared differences from the mean

Standard deviation: measure of how spread out the date is = the square root of variance.

20
Q

What’s standard deviation and how is it reported?

A

It’s a measure of how spread out the data is.

Always reported with the mean so MEAN(SD)
When SDs overlap, we are less confident in the results.

21
Q

SDs and percentages

A

NORMAL, 0-1 SD➡️ 68% (34% above and below the mean)
+/-1 to 2 SD ➡️ 14%
2-3 SD ➡️ 2%

22
Q

What’s interquartile range?

A

It reports only 50% of the data range. Located in the middle of distribution.

4 quartiles. Median is in the middle of the IQR.

23
Q

What’s standard error?

A

It’s a measure of how accurate an estimate of the population mean our sample mean is.

  • 95% confidence ➡️ 95% pop. fall in range (5% standard error)

The smaller the SE, the better the sample mean is an estimation of the population mean.

24
Q

How would we describe/report categorical data?

A

Reporting the frequency of cases of each category; reporting the percentage frequency of cases in each category.

Plot discrete data on pie or bar charts.