stats 3 Flashcards

1
Q

methods to describe and summarize data to describe its main features.

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

the type of value that a variable takes on

A

Measurement metric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Categorical variables

A
  • Nominal variables
  • Ordinal variables
  • Binary variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Numerical variables

A
  • Continuous variables
  • Discrete variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Categorical variables (def)

A

represent categories or groups and do not have a
numeric value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Nominal variables (def)

A

categorical variables with no inherent order or
ranking among the categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ordinal variables (def)

A

categorical variables that have a meaningful order
or ranking, but the intervals between the categories are not necessarily
equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Binary variables (def)

A

A special type of categorical variable that has only two possible values. (A.k.a, dichotomous or dummy variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Numerical variables (def)

A

represent quantities and can be measured on a numeric scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Continuous variables

A

can take any value within a range and can be subdivided into finer increments with equal unit distances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Discrete variables

A

can only take specific, distinct values, often counts or integers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Rank statistics

A

a class of statistics used to describe the variation of continuous variables based on their ranking from lowest to highest values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Median value

A

the value of the case that sits at the exact center of the cases when
we rank the values of a single variable from the smallest to the largest observed
values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Range

A

the difference between the minimum and maximum value of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Quartile

A

a statistical term that divides a dataset into four equal parts, with
each quartile containing 25% of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interquartile range (IQR):

A

the difference between the variable value at the 25% and the 75% ranks

17
Q

The interquartile range is a measure of the – or spread of values

A

dispersion

18
Q

Box-whisker plot

A

a graphical representation of data
that displays the median, quartiles, and potential outliers, using a box to show the interquartile range and “whiskers” to indicate the range of the data.

19
Q

— summarize data based on the order of values, while moments
provide numerical measures that describe the shape and distribution of the data.

A

Rank statistics

20
Q

Moments

A

numerical measures derived from the data values themselves and
their positions relative to the mean or origin. They provide information about the shape of the data distribution, including measures such as mean (first moment), variance (second moment), skewness (third moment), and kurtosis (fourth moment).

21
Q

Mean (first moment)

A

the average value of a variable

22
Q

The zero-sum property of the mean

A

if you subtract the mean of a dataset
from each data point, the sum of these deviations will always be zero.

23
Q

The – tells us that the mean is the value that best balances the total differences between each data point

A

least-squares property of the mean

24
Q

The mean of a variable is often called its - because it is the value you would most expect the variable to take.

A

expected value

25
Q

Variance (second moment)

A

a measure of the dispersion of a variable around its mean.

26
Q

Standard deviation

A

another measure of the dispersion of a variable around
it’s mean

27
Q

We can use a - to depict the dispersion of a variable (its variance and standard deviation)

28
Q
  • in histograms are the intervals or ranges into which data is grouped to
    show how frequently values fall within each range
29
Q
  • smooth out histograms.
A

Kernal density plots

30
Q

Kernal density plot

A

a visual depiction of the distribution of a single variable based on a smoothed calculation of the density of cases across the range of values

31
Q

Skewness (third moment)

A

a measure that indicates the symmetry of the variable’s distribution around the mean

32
Q

Skewness (third moment):

A

a measure that indicates the symmetry of the
variable’s distribution around the mean