Statistics Flashcards

1
Q

A symmetric distribution with equal mean, median, and mode
Standard deviations follow the 68 - 95 - 99.7 rule

A

Normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A normal distribution with a mean of 0 and a standard deviation of 1

A

Standard distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Asymmetric distribution. Named either left-skewed or right-skewed by what side the tail is on

For right skew distributions, mean > median.
For left skew distributions, median > mean.

A

Skewed distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

IQR = (third quartile) - (first quartile)

A

Interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Line inside the box represents the median

Upper and lower quartiles (75%, 25% of data rely beyond, respectively) indicated by upper and lower edges of the box

Maximum and minimum datapoints represented by the whiskers
95% confidence interval indicated by the notches. If notches between boxes do not overlap → statistical significance indicated

A

Box-and-whisker plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The factor that is being changed or manipulated

A

Independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The outcome that is being measured

A

Dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Variable that influences both the independent and dependent variables. Confounds the ability to determine causality
Example: Tobacco smoking confounds the relationship between chewing tobacco and mortality

A

Confounding variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Variable that influences the strength of a relationship between two other variables
Example: Amount smoked influences the relationship between cigarette smoking and mortality

A

Moderating variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Variable that explains the relationship between two other variables

Example: Increased cancer risk helps explain relationship between smoking and mortality

A

Mediating variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The r value, describing the linear relationship between two variables
Ranges from -1 to 1 and describes the direction and strength of an association
Example: -0.30 is a negative and moderate association, whereas +0.90 is a positive and strong association

A

Correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Study variables that are not directly measurable (e.g. depression, happiness) are defined in a way that they can be measured

A

Variable operationalization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

p value definition

A

Describes the likelihood of finding a difference when, in reality, there is no difference between the groups (null hypothesis is true)
Statistical significance threshold for the study is most often set at α = 0.05 (5%)
This means if p < 0.05, then we reject the null hypothesis; if p > 0.05 then we fail to reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Requires categorical variables
Compares null hypothesis vs alternative hypothesis, looking to see if the two distributions of categorical data differ

A

Chi-square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Requires continuous variables
Compares the mean values of continuous variables of 2 groups

A

T-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Requires continuous variables
Similar to t-test, but can be used for 3 or more groups

A

ANOVA

17
Q

A threshold that describes the chance that results of a study are due to random chance rather than causal effect, usually set at α = 0.05

A

Statistical significance

18
Q

When the null hypothesis is incorrectly rejected, in other words a false positive

A

type 1 error

19
Q

When the null hypothesis is incorrectly supported, in other words a false negative

A

type 2 error

20
Q

States that no significant difference or relationship exists between study variables

A

Null hypothesis

21
Q

factors that can cause a result of a study to differ from the true result

A

bias

22
Q

Bias introduced by the selection process of including subjects in a study (e.g. study population is not representative of the whole population)

A

selection bias

23
Q

Study subjects behave differently when they know they’re being studied

A

Hawthorne effect

24
Q

Bias in survey studies where people answer in a way considered socially desirable and acceptable

A

Social desirability bias

25
Q

Administration of an inactive substance or sham procedure corresponds to improved symptoms

A

Placebo effect

Often related to a person’s belief that a treatment will work

Comparison between placebo group and treatment group is used to determine the true benefit of a given treatment (where the patients are unaware if they received the treatment or placebo - known as blinding)

26
Q

Certain individuals in a population have a greater chance of being selected for a study than other individuals, resulting in a sample that does not accurately reflect the population

A

Sampling bias

27
Q

Tendency of a person to answer questions on a survey untruthfully or misleadingly

A

Response bias