Statistics Concepts Flashcards

1
Q

level of measurement***

A

refers to way numbers assigned to categories of variable (nominal, ordinal, ratio/interval)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

discrete variable***

A

assumes distinct values, can be finite or infinite, always some minimum unit beyond which categories cannot be divided

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

continuous variable***

A

can assume any value on continuum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

unit of analysis

A

Refers to specific group or entity on which statistical

analysis is performed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

data structure

A

types: cross-sectional, time series, panel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

array

A

univariate visualization involving list of observations in increasing or decreasing order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

stem-and-leaf plot

A

univariate visualization; Stem represents leading digit of number, Leaf represents trailing digits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

frequency distribution***

A

univariate visualization which counts number of times category repeated in data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

percent distribution***

A

univariate visualization which shows what percent of cases fall into each category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

cumulative distribution***

A

univariate visualization which shows relative position of category in distribution (e.g. percentiles of GRE test takers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

histogram***

A

univariate visualization which uses vertical bars to designate frequency/percent of category, no spaces between bars to convey that variable measured on continuum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

pie chart/bar graph

A

Pie charts/bar graphs are univariate visualizations used to graph nominal or ordinal variables, most useful when variable being graphed has few categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

mean***

A

Most commonly used measure of central tendency; mean restricted to interval/ratio variable because
involves arithmetic, also used with dichotomous
variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

trimmed mean

A

reduces influence of extreme values on calculation of mean by excluding portion of data in tails of distribution (e.g., 10 percent trimmed mean discards highest/lowest 10 percent of data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

median***

A

Measure of central tendency used with ordinal or higher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

mode***

A

measure of central tendency used for all levels of measurement

17
Q

variance***

A

denoted s^2, most commonly used measure of variability; takes into account all observations of variable

18
Q

standard deviation***

A

measure of variability, denoted S. Disadvantage of variance uses square of deviation from mean measure not expressed in units of original variable, can return to original metric by taking square root of variance

19
Q

range

A

Measure of variability measuring span of data, or

maximum possible difference in categories

20
Q

interquartile range

A

Modified version of range, not as susceptible to outliers; measured as range of middle 50 percent of
observations and equal to difference b/w third quartile (75th
percentile) and first quartile (25th percentile)

21
Q

box plot

A

Graphical device used to display univariate summary measures, displays 5 key measures on axis: min, Q1, median, Q3,
max

22
Q

skewness

A

skewness measures describe whether distribution

symmetrical or skewed. Rightward skew = positive, leftward skew = negative

23
Q

kurtosis

A

Measure of skewness reflecting peakedness or flatness of distribution; Leptokurtic = tall peak, kurtosis value > 3, Platykurtic = flat peak, kurtosis value < 3, Mesokurtic = symmetric peak, kurtosis = 3