gyu other questions you messed up chapter 1 - 3 Flashcards

1
Q

stratified random sampling

A

when important groups within a population are separately sampled

those samples are combined to form a full sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is a nominative variable?

A

a qualitative variable with no meaningful ordering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is undercoverage?

A

occurs when some population elements are excluded from the process of selecting the sample

It results in being unbalanced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is nonresponse

A

occurs whenever some of the individuals who were supposed to be included don’t want to come or simply can’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the science of using a sample of measurements to make generalizations about the important aspects of a population of measurements?

A

statistical inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If we examine all of the population measurements, we say that we are conducting a:

A

census of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a strata

A

In order to select a stratified random sample, we divide the population into nonoverlap-ping groups of similar elements (people, objects, etc.)

these groups are strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is systematic sampling

A

an alternative to random sampling

In order to systematically select a sample of n elements without replacement from a frame of N elements, we divide N by n and round the result down to the nearest whole number

Calling the rounded result ℓ, we then randomly select one element from the first ℓ elements in the frame—this is the first element in the systematic sample

The remaining elements in the sample are obtained by selecting every ℓth element following the first (randomly selected) element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the flaws of multiple choice survey questions?

A

may fail to discover the reasons underlying a respondent’s choice of a particular answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the flaws of dichotomous survey questions?

A

possible responses are quite limited

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the flaws of open-ended survey questions?

A

can be difficult to evaluate and summarize

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is a ratio variable

A

a quantitative variable measured on a scale such that ratios of its values are meaningful and there is an inherently defined zero value

variables such as salary, height, weight, time, and distance are ratio variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is an interval variable

A

quantitative variable where ratios of its values are not meaningful and there is not an inherently defined zero value

Temperature (on the Fahrenheit scale) is an interval variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does a frequency distribution do

A

summarizes the number of items in each number of several non overlapping classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does a frequency polygon do

A

useful for comparing several distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the cumulative relative frequency

A

when all the relative frequencies equal to 1.00

17
Q

how do you construct a frequency polygon

A

you plot points above each class’ midpoint at a height equal to the relative frequency of a class

18
Q

how do you calculate the number of classes in a frequency distribution

A

2^k > n

k is the number of classes

19
Q

what type of chart can:

identify quality problems

suggest where a process needs improvement

help prioritize problem solving activities

A

a Pareto chart

20
Q

when would you use the Chebysev theorem?

A

when the curve is not mound shaped (for ex: two peaks)+

also not much skewness

the population has mean and standard deviation

21
Q

for the chebysev theorem, why would you not use a mound shape for it

A

because you can use the Empirical rule

22
Q

what does the Chebysev theorem tell you?

A

how many standard deviations the x is from the mean and shit

same as empirical rule, but with more fucked up shid

23
Q

what does the Chebysev theorem tell you?

A

how many standard deviations the x is from the mean and shit

how much of the population lies between a certain amount if standard deviations

same as empirical rule, but with more fucked up shid

24
Q

why would it not be okay to use any formula using standard deviation for a population with a lot of skewdness

A

the standard deviation will be inflated as shied

25
what should you use instead of a formula using a standard deviation for a population skewed af
percentiles is gyuer
26
what is the calculation of the coefficient of variation
(standard deviation / mean) * 100
27
when can you use the empirical rule
on mount shaped distributions normal distributions (mount shaped be included)
28
what is a obtained from a sample that is used to estimate a similar value from a larger population?
a point estimate
29
what is the covariance
strength of the linear relationship between x and y
30
when do we use the sample covariance?
when we wish to measure the strength of the linear association between two numerical values
31
what is grouped data
the only data available are in the form of a frequency distribution or a histogram it does not give the individual measurements in a data set
32
when approximating the variance for grouped data, what is used to approximate the measurements in a class
the midpoint of each class
33
when do you use a weighted mean
when it makes sense to give different measurements unequal weights
34
how do you use the weighted mean
multiplying each measurement by its weight summing the resulting products and dividing the resulting sum by the sum of the weights
35
what is the best measure of central tendency when outliers are present?
the mode
36
if the measurements of a data set (or curve) is more spread out, how is the variance
bigger
37
if the measurements of a data set (or curve) is more spread out, how is the standard deviation
bigger
38
the raw deviations from the mean sum to what?
0
39
how do you calculate outliers
outside lower limit: Q1 - 1.5*IQR outside upper limit: Q3 + 1.5*IQR