stats Flashcards by providence smith

Continous variable

reflects a infinite number of potential values such as the average rainfall in a region

How well did you know this?

Not at all

Perfectly

Discrete variable

countable # of distinct values (heads or tails)

How well did you know this?

Not at all

Perfectly

to determine probability distribution

x values must be between 0 and 1 and sum of all must equal 1

How well did you know this?

Not at all

Perfectly

population

entire group

How well did you know this?

Not at all

Perfectly

sample

specific group you collect data from

How well did you know this?

Not at all

Perfectly

statistics

number describing a sample

How well did you know this?

Not at all

Perfectly

parameter

number describing the whole population

How well did you know this?

Not at all

Perfectly

accuracy

the mothod measures what it intended, the statistic correctly estimates the population parameter

How well did you know this?

Not at all

Perfectly

precise

if the method is repeated, the estimates are very consistent every statistic is nearly the same

How well did you know this?

Not at all

Perfectly

sampling methods that create bias

convience sampling
voluntary sampling

How well did you know this?

Not at all

Perfectly

preferred method

simple random sampling

How well did you know this?

Not at all

Perfectly

what are the properties of the sampling distribution

Sampling distribution’s mean (μ¯X) = Population mean (μ) Sampling distribution’s standard deviation (Standard error) = σ√n,
shape
central tendency
variabiliy

How well did you know this?

Not at all

Perfectly

example of measurement bias (leading question)

do you believe that obama’s horrible beliefs deserve another term in order to ruin our lives.

How well did you know this?

Not at all

Perfectly

example of measurement bias (confusing question)

do you not disagree with the not recent slight changes to the american culture?

How well did you know this?

Not at all

Perfectly

example of a nonresponse bias

do you currently have an std?

How well did you know this?

Not at all

Perfectly

example of voluntary response bias

an internet poll asks its visitors if they prefer cats or dogs

How well did you know this?

Not at all

Perfectly

example of a sample bias (nonrandom sample)

someone asks their twitter followers how they feel about the recent changes to congress

How well did you know this?

Not at all

Perfectly

how do you measure precision

by using standard error

as population size increases, do accuracy and precision change?

no, both are unaffected

as sample size increases, do accuracy and precision change?

accuracy in unaffected, and it becomes more precise.

what does it mean to say that p-hat is a random variable

repeated sampling will result in different p-hat values.

Suppose a statistician is interested in determining the percentage of Americans who prefer Burger King to McDonald’s. She surveys 100 randomly chosen Americans and finds that of those surveyed, 37% prefer Burger King.
Identify…
a. the population
b. the sample
c. the parameter
d. the statistic/estimator of the study

a. americans.
b. 100 americans.
c. proportion of americans who are burger king to the number of burger king fans.
d. 37%

An analyst wants to know if there is a connection between time spent watching TV per day in hours and fat intake per day in grams. He performs a regression using time spend watching TV as the independent variable and fat intake as the dependent variable and finds that r = 0.5 and the regression line is given by: y = 45.8 + 10.3x
a). Explain what the correlation and regression line mean in the context of the data.
b). predict the fat intake of someone who watches 3 hours of tv a day
c). predict y when x=-2
d). which prediction is more reasonable?

a. The correlation means there is a moderate positive connection between time spent watching TV and fat intake.
The regression line means that for each additional hour of TV someone watches, we predict their fat intake will increase by 10.3 grams(slope), and the predicted fat intake of someone who watches no TV is 45.8(intercept)
b. 76.7
c. 25.2
d. b is more reasonable because you can’t watch a negative number of hours of tv in a day

what are the 4 requirements of the central limit theorem

Random and independent sample, population at least 10x the sample size, np ≥ 10, n(1p) ≥ 10; if you don’t know p, use p-hat

A pollster is trying to determine whether Candidate X will win an upcoming election(assume Candidate X needs 51% of the vote to win.) The pollster takes a random sample and determines that phat is .49 and the 90% percent confidence interval is given by (.45, .53). a. what is the margin of error? b. can the pollster be confident that candidate x will lose?

a. .04 or 4% b. no, the confidence interval is below 50% so we can't be sure.

For each situation, state the appropriate null and alternative hypothesis... a. ohio claims that 23% of high school seniors are enrolled in at least 1 ap class. the principal wants to know if the proportion of seniors enrolled in ap class is higher. b. the cleveland metro police claim 8% of cleveland residents were the victims of a robbery or attempted robbery last year. a statistition believe that this number is too high.

a) H0: p = 0.23 vs Ha: p > 0.23 b) H0: p = 0.08 vs Ha: p < 0.08

What should be done to create a confidence interval for a population proportion?

Add and subtract the margin of error to/from the sample proportion

Which of the following does the confidence level measure?

The success rate of the method of finding confidence intervals

Which of the following conditions regarding sample size must be met to apply the Central Limit Theorem for Samples?

The sample size is large enough that the sample expects 10 successes and 10 failures

When taking samples from a population and computing the proportions of each sample, which of the following is always the same?

The population proportion

What is the standard deviation of the sampling distribution called?

Standard error

A researcher has designed a survey in which the questions asked do not produce a true answer. What is this an example of?

Measurement Bias

In a confidence interval, what does the margin of error provide?

How far the estimate is from the population value

Which of the following statements are false? I) The precision of an estimator does not depend on the size of the population II) The precision of an estimator does not depend on the size of the sample III) Surveys based on larger sample sizes have larger standard errors

Both II and III are false

When applying the central limit therom for sample proportions, which of the following can be substituted for p when calculating the standard error if the value of p is unknown?

The value of the sample proportion

If the conditions of a survey sample satisfy those required by the Central Limit Theorem, then there is a 95% probability that a sample proportion will fall within how many standard errors of the population proportion?

2 standard errors

The null hypothesis is always a statement about what?

population parameter

In hypothesis testing, the null hypothesis is best described by which of the following statements?

The null hypothesis always gets the benefit of the doubt and is assumed to be true throughout the hypothesis testing procedure

What is true in a hypothesis test, the farther the test statistic is from 0?

The more null hypothesis is discredited

In hypothesis testing, what does an extreme value for the test statistic indicate?

The null hypothesis is not true

In hypothesis testing, when should the null hypothesis be rejected?

When the p-value is less than the significance value

what do we assume about CLT

random smapling

clt conditions to create a valid confidence interval

1. sample must be random and independent 2. normally distrubuted or equal to 30% 3. not more than 10% of population