chapter 3: descriptive statistics, numerical methods, and some predictive analytics Flashcards

1
Q

what is the data set’s central tendency?

A

represents the center or middle of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the population mean (μ)?

A

average of the population measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Population parameter

A

number calculated from all the population measurements

describes some aspect of the population

population mean (μ (mew)) is part of this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample statistic

A

number calculated using the sample measurements

describes some aspect of the sample

to find a point estimate of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the sample mean

A

it is X bar

average of the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the median

A

The value of the middle point of the ordered measurements

value such that 50% of all measurements lie above (or below) it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the mode

A

The most frequent value

most frequently occurrent measurement in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what happens to the median if the number of measurements is odd

A

the median is the middlemost measurement in the ordering (In increasing order)

the most in the. middle

are the values that are observed “most typically”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what happens to the median if the number of measurements is even

A

the median is the average of the two middlemost measurements in the ordering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

so what is the difference between the mean and the median

A

the mean will find the average

the median will find the most middle point of the measurements (in increasing order)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does it mean for data to have two modes

A

the data is bimodal

there is a higher frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

if there is skewness, should you use the mean or the median

A

median

If there is skewness, the mean is trash to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

are all measures of central tendency necessarily typical values??

A

naaaah boyyy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is a point estimate

A

one number estimate of the value of the population parameter

should not be a blind guess

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is “n” for the formula of the sample mean?

A

number of sample measurements

sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

will the sample mean always equal the population mean??

A

aaaah boy

unless you are extremely lucky

17
Q

what happens when more than two modes exist in data?

A

the data is multimodal

18
Q

what is a modal class?

A

only happens if data is presented in classes

it is the class having the highest frequency

19
Q

when is a mean or median used more than a mode?

A

when we want to describe a data set’s central tendency by using a single number

20
Q

why would we use the relative frequency curve on a histogram?

A

to smooth out the shape of the sample population

21
Q

describe the mean, median, and mode in a symmetrical relative frequency curve

A

the mean, median, and mode are all equal

22
Q

describe the mean, median, and mode in a skewed to the right relative frequency curve

A

mean > median > mode

mode is located under the highest point of the frequency curve

mean is larger cause it also average larger values

median is resistant to extreme values but mean is not

23
Q

describe the mean, median, and mode in a skewed to the left relative frequency curve

A

mean < median < mode

mode is located under the highest point of the frequency curve

mean is smaller cause it also average smaller values

24
Q

what is the range?

A

Largest minus the smallest
measure

Measures the interval spanned by all the data

25
Q

what is the variance

A

average of the squared deviations of all the population measurements from the population mean

26
Q

what is the standard deviation

A

The square root of the population variance

Tells you how much deviation there is between the population variance

27
Q

what are the there absolute measures

A

standard deviation

variance

range

28
Q

what is the empirical rule?

A
  1. 26% of the population measurements
  2. 44%
  3. 73%
29
Q

when do we use the emperical rule?

A

When we have a symmetrical or bell shaped distribution, this shit works really well

If a population has mean µ and standard deviation σ and is described by a normal curve

30
Q

when is chebysev’s theorem practical?

A

With not much info, but do have mean of population (mew) and variance (sigma (sugma dicc))

Don’t use if its skewed af

31
Q

what is the chebysev’s theorem formula

A

100(1-1/k2)%