Statistics Flashcards

1
Q

What is the difference between inferential and descriptive stats?

A

Inferential stats help us generalize (make inferences) about a wider population from a smaller sample. They are based on the assumption that the population sampled has normal distributed characteristics and the sample is randomly selected (probability sample).
Descriptive stats organizes and summarizes data using numbers and graphs. It uses data summary (bar graphs, pie charts, etc), measures of central tendency (mean, median, mode), and measures of variability (range, variance, and standard deviation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the difference between statistic versus parameter?

A

Parameter: whole population
Statistics: describing sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are univariate, bivariate, and multivariate analysis?

A

Uni-variate: analysis of one variable (diagrams)
Bi-variate: analysis of two variables, explores relationship between them and searches for correlations
Multi-variate: three or more variables, the relationship might be spurious, or an intervening variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the types of multivariate analysis?

A

Common factor analysis and principle component analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a bell curve, platy, lepto, and skew?

A

Bell curve/normal curve: what usually occurs; the mean, median, and mode end up in the middle at about the same place, scores cluster at the middle
Platykurtic: flat relative to the normal curve
Leptokurtic: tall relative to the normal curve
Skew: where the tail is distribution is, negative skew is when the tail is in the low numbers (down to the left), positive skew is when the tail is in the high numbers (down to the right)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the central limit theorem?

A

The bigger the sample the better. In many situations, it Will eventually lead to a normal curve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the normal curve? What Percentages are important?

A

68.3% of its values occur plus or minus one SD from the mean
95% of its values occurs plus or minus two SD from the mean
99.7% of its values occur plus or minus three SD from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is there to note about missing data/anomalous data?

A

If there is data that is completely missing, don’t use it. If there is partial missing data, its the researchers judgement call.
Anomalous data is data that appears “out of line,” discrepancies indicate measurement problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the mean, median, and mode?

A

Mean: sum all values in distribution then divide by total number of values
Median: Middle point with entire range of values
Mode: Most frequently occurring value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the measurements of dispersion?

A

Standard Deviation: average amount of variation around the mean
Maximum/minimum
Range: highest value minus lowest value
Interquartile range: calculated after highest and lowest 25% are removed
Variance: a measure of the extent to which values in data set vary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a z score?

A

A z score is the number of units of standard deviation any one value is above or below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the differences between ANOVA vs T test and Correlation?

A

T Test (compare means)
Correlation (levels of data needed)
ANOVA (analysis of variance), compares three or more groups and compares variance among groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are F, R, and T values?

A

T value: ratio of the difference between the mean of the two sample sets and the variation that exists between them
R value: correlation coefficient that measures the strength of the relationship between two variables
F value: the ratio of two variances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are P values?

A

Probability level
Calculated by converting your statistic into a z score
Used to determine the likelihood that an observed outcome is the result of chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Null hypothesis?

A

Testing offers probability, not certainty. Must decide to accept probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the chi square?

A

Analyzes the relationship between two or more different categorical variables

16
Q

What is the difference between causation and correlation?

A

Causation: A causes B
Correlation: A relates to B