Lecture 11 9/18/24 Flashcards

1
Q

How are data sets typically arranged?

A

-variables are in columns
-experimental units are in rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the characteristics of nominal data?

A

-categories should be exhaustive
-categories should be mutually exclusive
-no comparative relationship is implied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the characteristics of ordinal data?

A

-categories should be all inclusive and mutually exclusive
-categories should have rank order with qualitative differences in relative amounts between the categories
-distances between categories are not assumed to be equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the characteristics of continuous data?

A

-values should be all inclusive and mutually exclusive
-differences between values is uniform across the entire scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two types of continuous scale data?

A

-interval, measured in whole integers
-ratio, measured in decimals or fractions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is it important to distinguish between categorical and continuous variables?

A

-determines the method of presentation in graphs/tables
-determines the choice of statistical tests for significance
-different statistics are often used for nominal vs. ordinal variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the characteristics of expressing continuous data as categorical?

A

-can always be done, but with a loss of information
-loss of information often leads to less statistical power
-continuous variables should only be categorized if there is good reason

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is it best to use mean vs. median to represent central tendency?

A

-mean works well in large populations with normal distribution
-median works well when the distribution is skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which measures of central tendency are resistant to extreme values, and which are not?

A

resistant: median, mode
non-resistant: mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is it important to NOT calculate a mean with number-labelled categories?

A

because they are categories, the difference between “numbers” does not need to be equal; therefore the mean does not actually represent the average of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is measure of dispersion?

A

extent to which a set of scores deviate from some measure of central tendency for that set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is range?

A

the difference between the largest and smallest values in the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What do percentiles and quartiles measure?

A

the proportion of all observations that fall between specified values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is kurtosis?

A

when the values in a data set skew either to the average of the data set or to the extremes of the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is skewness?

A

when the values in a data set skew to one side of the data set (minimum or maximum)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does a normal distribution differ from a non-normal distribution?

A

-normal distribution is accurately described by the mean and standard deviation
-non-normal distribution does not fit as well with the mean and standard distribution

17
Q

Why is it useful to divide a distribution into percentile segments?

A

it can be used to compare two distributions for equality

18
Q

What is an independent variable?

A

-causal, predictor, exposure, or explanatory variable
-change in the variable influences an outcome

19
Q

What is a dependent variable?

A

-outcome variable
-change in the variable results from independent variable change

20
Q

What type of statistical test is used on continuous data with a normal distribution and 2 groups?

A

T test

21
Q

What is the null hypothesis for a T test?

A

H0: means are equal

22
Q

What type of statistical test is used on continuous data with a normal distribution and more than 2 groups?

A

ANOVA

23
Q

What is the null hypothesis for an ANOVA?

A

H0: means are equal

24
Q

What types of statistical test are used on discrete data with 2 non-paired groups?

A

-chi-square test if there are more than 5 data points in 75% of cells or more
-Fisher’s exact test if there are more than 5 data points in less than 75% of cells

25
Q

What type of statistical test is used on discrete data with more than 2 groups?

A

chi-square test

26
Q

What is the null hypothesis for tests on discrete data?

A

H0: proportions are equal