Quanti - Descriptive Statistis Flashcards

1
Q

What is statistics?

A

“the practice or science of collecting and analysing numerical data in
large quantities, especially to make inferences on a population based on
a representative sample.”

To helps us turn data into information that can be interpreted, understood and used to improve evidence-based healthcare

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 2 broad classifications of statistics?

A

Descriptive and inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are descriptive statistics?

A

to provide description of population through numbers, graphs and tables

(COLLECT, SUMMARIZE, DESCRIBE).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are inferential statistics?

A

to provide meaningful inferences/conclusions on
the population based on data collected from a sample

(INTERPRET, GENERALIZE, PREDICT).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How are continuous variables usually presented?

A
  1. Measure of central tendency
  2. Measure of dispersion/variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are categorical variables usually presented?

A

Measure of frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What kinds of statistical methods are used for inferential statistics?

A
  1. Hypothesis testing
  2. Regression analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do the following terms mean?

  1. Population
  2. Parameter
  3. Variable
  4. Sample
  5. Statistic
A

Population: Collection of entire set of individual objects or events of interest

Parameter: Numerical characteristic of population

Variable: Characteristic that is being measured.

Sample: Subset of a population

Statistic: Measure that describes the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Descriptive statistics for categorical variables - Measure of frequency

A

How are they presented?

  1. Frequency (%):
    e.g. Males 70 (70%); females 30 (30%)
  2. Cross-tabulation
    (present frequency in a table form)
  3. Use of pictogram
  4. Use of pie, bar, column, line, scatter, all sorts of charts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive statistics for continuous variables - Measure of central tendency

A

Usually, mean, median and mode are used.

Can be illustrated via histogram (barchart stuck together) or boxplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is mean?

A
  • arithmetic average of a set of values
  • more suitable for symmetric
    distribution
  • OFTEN REPORTED WITH STANDARD DEVIATION
    e.g. mean (SD)

Formula:
Add all values divided by total number of values (frequency)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is median?

A
  • middle value of a data set when arranged in ascending or descending order
  • more suitable for SKEWED distribution
  • OFTEN REPORTED WITH INTERQUARTILE RANGE
    e.g. median (IQR))

Formula:
If n (frequency)= 9,
Median: ((9+1))/2 th term: 5th term

If n (frequency)= 10,
Median: (5th term + 6th term)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is mode?

A

value that occurs most frequently in a data set

Example:
Data set {2, 3, 4, 4, 5, 6, 6, 6, 7}.
Mode = 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a normal distribution?

A

Mean, median and mode coincides at one point on the x-axis. (see slide 12)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a left skewed distribution?

A

Mean is on the left side of x-axis. (see slide 15)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a right skewed distribution?

A

Mean is on the right side of the x-axis. (see slide 15)

17
Q

What is the normality of distribution?

A
  • All kinds of naturally occurring variables are usually normally distributed (Bell curve)
    e.g. height, weight

Central limit theorem: If sample size > 30, should follow a normal distribution

  • Main property: mean, median and mode are the same (symmetrical curve)
18
Q

What is skewness in a normal distribution?

A

lack of symmetry

  • tells us direction of variability
  • different from variance, which tells magnitude of variability
19
Q

What is variance?

A
  • average of squared differences of each datapoint
    from mean (squared unit of mean)
  • if sigma, σ2=0: all data values are the same
  • Small variance= data are close to mean and each other
  • Large variance= data are far from mean and each other
20
Q

What is standard deviation?

A

square root of variance (same unit as mean)

21
Q

What is the 68-95-99.7 rule?

A

a statistical principle that applies to bell-shaped, normal distributions

tells you where most of your values lie in a normal distribution:

~68% of values are within 1 SD from the mean

~95% of values are within 2 SD from the mean

~99.7% of values are within 3 SD from the mean