Lecture 3: Starting Points in Data Analysis Flashcards

1
Q

What are the quartiles on a boxplot?

A

first quartile (Q1): divides the lowest 25% of the data form the highest 75% (25th percentile or lower quartile)

second quartile (Q2): divides the data in half. 50th percentile or median

third quartile (Q3): divides the highest 25% of the data form the lowest 75%. 75th percentile or upper quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the interquartile range (IQR)?

A

Q3-Q1
sometimes referred to as a middle 50%
measure of variability
median and IQR often reported for variables that are not normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are outliers?

A

extreme observations in your variable of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does SPSS identify outliers?

A

according to Tukey’s fences method

  • values below Q1-(1.5IQR) or above Q3+(1.5IQR) –> these are marked with an O
  • values below Q1-(3IQR) or above Q3+(3IQR) –> these are marked with an * (more extreme values)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Quantile-Quantile Plots (Q-Q Plots)

A

determines if a variable comes form a specified distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Stem and leaf plot

A

displays the frequency at which certain classes of values appear in the data
can be used to examine distribution of data as well as extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the statistical tests for normality?

A

Shapiro-Wilk test

KS test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Shapiro-Wilk test?

A

tests null hypothesis that data came from a normally distributed population
more accurate when the sample sizes are <2000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Kolmogorov-Smirnov test?

A

Goodness of fit test –> tests null hypothesis that a sample comes from a specified distribution
More accurate when sample sizes are large (n ≥ 2000)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does it mean when you reject the null hypothesis in the test for normality?

A

It means that you fail the normality test and have a significant p value (<0.05) so you dont have data that came from a normally distributed population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does it mean when you accept the null hypothesis or fail to reject it in the test for normality?

A

It means that you pass the normality test and have a non-significant p-value (>0.05) so you have data that comes from a normally distributed population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some other parameters to assess normality?

A

Skewness and Kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is skewness?

A

measure of asymmetry

- normal distribution has skewness of 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is kurtosis?

A

measure of tail density relative to a normal distribution

- normal distribution has kurtosis of 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

On SPSS, what is the ideal measurement of skewness and kurtosis for a normally distributed data?

A

skewness <1

kurtosis between 0-3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Platykurtic

A

kurtosis value <3

no tails

17
Q

Mesokurtic

A

average scale density that you would expect to see in a normal distribution

18
Q

Laptokurtic

A

kurtosis value >3
heavy tailed distribution
smaller peak

19
Q

Skewness <0

A

negatively skewed

to the left

20
Q

Skewness >0

A

positively skewed

to the right

21
Q

Skewness = 0

A

normally distributed

22
Q

When do we do a data transformation?

A

when data is not normally distributed, common to apply a transformation to attempt to improve the normality

23
Q

How to transform a right skewed data? (in order of severity)

A
  • Reciprocal transformation: t = 1/x
  • Log transformation: t = log10(x)
  • Square root transformation: t = sqrt(x)
24
Q

How to transform a left skewed data? (in order of severity)

A

Cubic transformation: t = x (cubed)

Square transformation: t = x (sqrd)

25
Q

When can’t we use reciprocal and log transformation?

A

on 0 value data (add a small constant to every value to fix it)

26
Q

How do we energy adjust macronutrients?

A

express intake as proportion of total energy intake (% calories from total fat)

27
Q

How do we energy adjust micronutrients?

A

intake per 1000kcal

28
Q

How do we energy adjust food groups?

A

intake per 1000kcal

29
Q

What is hypothesis testing?

A

method of determining if results from your study are meaningful
how likely is it that the results arose by chance

30
Q

What is hypothesis?

A

educated guess about your variables of interest

31
Q

What is null hypothesis Ho vs alternate hypothesis H1?

A

Null: there is NO statistically significant difference between the population parameter and the sample statistic being compared

Alternate: statistically significant difference exists between the population parameter and the sample statistic being compared

32
Q

What is a Type 1 error (a)?

A

the rejection of a true null hypothesis (also known as a “false positive” finding)

33
Q

What is a Type 2 error (B)?

A

the failure to reject a false null hypothesis (also known as a “false negative” finding)

34
Q

How is rejection of a false null hypothesis represented?

A

1-B (power of the test) related to the sample size

35
Q

How is acceptance of a true hypothesis represented?

A

1-a (confidence interval)

36
Q

What is the p value?

A

probability value

  • based off the assumption that H0 is true
  • gives the probability that results arose simply by chance.
37
Q

What is alpha?

A

the significance level: probability of rejecting the null hypothesis when the null is true
1% p<0.01 (2.58)
5% p<0.05 (1.96)
10% p<0.10 (1.65)

38
Q

One-tailed test vs Two-tailed test

A

one tailed test

is testing the possibility of a relationship in one direction only (comparing means) (u>u0 or u