gyu other questions you messed up chapter 1 - 3 Flashcards

1
Q

stratified random sampling

A

when important groups within a population are separately sampled

those samples are combined to form a full sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is a nominative variable?

A

a qualitative variable with no meaningful ordering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is undercoverage?

A

occurs when some population elements are excluded from the process of selecting the sample

It results in being unbalanced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is nonresponse

A

occurs whenever some of the individuals who were supposed to be included don’t want to come or simply can’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the science of using a sample of measurements to make generalizations about the important aspects of a population of measurements?

A

statistical inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If we examine all of the population measurements, we say that we are conducting a:

A

census of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a strata

A

In order to select a stratified random sample, we divide the population into nonoverlap-ping groups of similar elements (people, objects, etc.)

these groups are strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is systematic sampling

A

an alternative to random sampling

In order to systematically select a sample of n elements without replacement from a frame of N elements, we divide N by n and round the result down to the nearest whole number

Calling the rounded result ℓ, we then randomly select one element from the first ℓ elements in the frame—this is the first element in the systematic sample

The remaining elements in the sample are obtained by selecting every ℓth element following the first (randomly selected) element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the flaws of multiple choice survey questions?

A

may fail to discover the reasons underlying a respondent’s choice of a particular answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the flaws of dichotomous survey questions?

A

possible responses are quite limited

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the flaws of open-ended survey questions?

A

can be difficult to evaluate and summarize

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is a ratio variable

A

a quantitative variable measured on a scale such that ratios of its values are meaningful and there is an inherently defined zero value

variables such as salary, height, weight, time, and distance are ratio variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is an interval variable

A

quantitative variable where ratios of its values are not meaningful and there is not an inherently defined zero value

Temperature (on the Fahrenheit scale) is an interval variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does a frequency distribution do

A

summarizes the number of items in each number of several non overlapping classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does a frequency polygon do

A

useful for comparing several distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the cumulative relative frequency

A

when all the relative frequencies equal to 1.00

17
Q

how do you construct a frequency polygon

A

you plot points above each class’ midpoint at a height equal to the relative frequency of a class

18
Q

how do you calculate the number of classes in a frequency distribution

A

2^k > n

k is the number of classes

19
Q

what type of chart can:

identify quality problems

suggest where a process needs improvement

help prioritize problem solving activities

A

a Pareto chart

20
Q

when would you use the Chebysev theorem?

A

when the curve is not mound shaped (for ex: two peaks)+

also not much skewness

the population has mean and standard deviation

21
Q

for the chebysev theorem, why would you not use a mound shape for it

A

because you can use the Empirical rule

22
Q

what does the Chebysev theorem tell you?

A

how many standard deviations the x is from the mean and shit

same as empirical rule, but with more fucked up shid

23
Q

what does the Chebysev theorem tell you?

A

how many standard deviations the x is from the mean and shit

how much of the population lies between a certain amount if standard deviations

same as empirical rule, but with more fucked up shid

24
Q

why would it not be okay to use any formula using standard deviation for a population with a lot of skewdness

A

the standard deviation will be inflated as shied

25
Q

what should you use instead of a formula using a standard deviation for a population skewed af

A

percentiles is gyuer

26
Q

what is the calculation of the coefficient of variation

A

(standard deviation / mean) * 100

27
Q

when can you use the empirical rule

A

on mount shaped distributions

normal distributions (mount shaped be included)

28
Q

what is a obtained from a sample that is used to estimate a similar value from a larger population?

A

a point estimate

29
Q

what is the covariance

A

strength of the linear relationship between x and y

30
Q

when do we use the sample covariance?

A

when we wish to measure the strength of the linear association between two numerical values

31
Q

what is grouped data

A

the only data available are in the form of a frequency distribution or a histogram

it does not give the individual measurements in a data set

32
Q

when approximating the variance for grouped data, what is used to approximate the measurements in a class

A

the midpoint of each class

33
Q

when do you use a weighted mean

A

when it makes sense to give different measurements unequal weights

34
Q

how do you use the weighted mean

A

multiplying each measurement by its weight

summing the resulting products

and dividing the resulting sum by the sum of the weights

35
Q

what is the best measure of central tendency when outliers are present?

A

the mode

36
Q

if the measurements of a data set (or curve) is more spread out, how is the variance

A

bigger

37
Q

if the measurements of a data set (or curve) is more spread out, how is the standard deviation

A

bigger

38
Q

the raw deviations from the mean sum to what?

A

0

39
Q

how do you calculate outliers

A

outside lower limit: Q1 - 1.5*IQR

outside upper limit: Q3 + 1.5*IQR