Stats M. 2 Flashcards

1
Q

relative frequency distribution

A

listing of distinct values and their relative frequencies (proportions or percentages)

Numerical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bar Chart

A

Graphical Summary for categorical data
-bars do not touch each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pareto diagram

A

graphical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

pie chart

A

graphical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean

A

sum of observations divided by the number of observations

numerical summary for quantitative data

sensitive to/affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

median

A

the number that divides the bottom 50% of the data from the top 50% of the data

numerical summary for quantitative data

not sensitive to/not affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mode

A

any value that occurs with the greatest frequency

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

percentiles

A

indicate the point below which a certain percentage of observations fall`

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

quartiles

A

special type of percentile that divides data into quarters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Q1

A

25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q2

A

median- 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Q3

A

75%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

standard deviation

A

tells us whether the observations within the data set tend to be close to the mean or far away from the mean

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

IQR

A

the difference between Q3 and Q1
tells us about the variability of the middle 50%

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

range

A

difference between the maximum and minimum value

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dotplot

A

graphical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

histogram

A

graphical summary for quantitative data

Bars touch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

density plot

A

graphical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

box plot

A

graphical summary for quantitative data
5 number summary (minimum, Q1, median, Q3, maximum)

20
Q

time plots

A

graphical summary for quantitative data

21
Q

S.O.C.S

A

Shape
Outliers
Center
Spread

22
Q

Shape

A

Unimodal, bimodal, multimodal
skewness or symmetrical
-left skewed= tail goes to negative side
-right skewed=tail goes to positive side

23
Q

Outliers

A

unusual values

24
Q

Center
-symmetric + no outliers

A

Report the mean

25
Q

Center
-skewed +/or outliers

A

report the median

26
Q

Spread
-symmetric + no outliers

A

report the standard deviation

27
Q

Spread
-skewed +/or outliers

A

report the IQR

28
Q

Comparative graphical displays (quantitative + categorical)

A

SOCS
Histogram + box plot

29
Q

Bivariate Data

A

data that contains 2 variables

30
Q

Association
(Bivariate Data)

A

a relationship between two variables

31
Q

Response Variable
(Bivariate Data)

A

measured to make comparisons between groups

32
Q

Explanatory Variable
(Bivariate Data)

A

explains the value of the response variable

33
Q

contingency table

A

a frequency distribution for bivariate data (also called a two-way or cross-tabulation table)

34
Q

conditional proportions
(Bivariate Data)

A

proportions based on the explanatory variable for the categories of the response variable
(divide each cell count by the corresponding row total)

35
Q

No association
(Bivariate Data)

A

values (%) within each column or bar heights of same color are similar

36
Q

Yes association
(Bivariate Data)

A

values (%) within each column or bar heights of same color are different

37
Q

comparative bar chart

A

a chart that compares the conditional proportion of the response variable within each category of the explanatory variable

38
Q

Mosaic plots

A

another comparative chart

39
Q

Scatterplots

A

summarize bivariate quantitative data

40
Q

Positive Association
(bivariate quantitative data)

A

as values of one variable increase, so do values of the other

41
Q

Negative association
(bivariate quantitative data)

A

as values of one variable increase, values of the other variable decrease

42
Q

No association
(bivariate quantitative data)

A

no apparent relationship between the two variables

43
Q

correlation

A

measure of the strength and direction of the linear relationship between two variable

44
Q

weak correlation

A

positive: 0 < r < 0.4
negative: -0.4 < r < 0

45
Q

moderate correlation

A

positive: 0.4 < r < 0.8
negative: -0.8 < r < -0.4

46
Q

strong correlation

A

positive: 0.8 < r < 1
negative: -1 < r < -0.8

47
Q
A