DECK 12: INFERENCE PART A (1 samp hyp tests and intervals) Flashcards

Question 1

Q

notation: what is mu

Answer

A

true population mean (average)

Question 2

Q

notation: what is p

Answer

A

true population proportion (percent in the population)

Question 3

Q

notation: what is x-bar

Answer

A

mean of your sample

Question 4

Q

notation: what is p-hat

Answer

A

sample proportion (percent in our sample)

Question 5

Q

notation: what is a p-value

Answer

A

At the end of a hypothesis test, it is the likelihood of getting your results if the null was true.

Question 6

Q

notation: what is z*

Answer

A

critical z, how many SE you are reaching up and down in a confidence interval for proportions

Question 7

Q

notation: what is t*

Answer

A

critical t, how many SE you are reaching up and down in a confidence interval for means

Question 8

Q

notation: what is mu - mu

Answer

A

true difference between two populatinon means

Question 9

Q

notation: what is p - p

Answer

A

true difference between two population proportions (percents).

Question 10

Q

notation: what is xbar- xbar

Answer

A

difference between two sample means

Question 11

Q

notation: what is phat - phat

Answer

A

difference between two sample proportions

Question 12

Q

notation: what is Ho

Answer

A

The NULL, the dull, the “things haven’t changed” hypothesis

Question 13

Q

notation: What is Ha

Answer

A

The alternative. This is what you are trying to prove.

Question 14

Q

What is the difference between the distribution of a sample and a sampling distribution?

Answer

A

A distribution of a sample is just a histogram of the DATA in a sample. A sampling distribution is made from an bunch of sample STATISTICS. It is the distribution of the statistic that was calculated from those many many samples.

Question 15

Q

What is a sampLING distribution?

Answer

A

a pile of statistics. A pile of p-hats or x-bars.

Question 16

Q

Are models what really happen?

Answer

A

No. A model train is not a real train. We use models to say what kind of happens.

Question 17

Q

What is “statistically significant?”

Answer

A

When our sample statistic is so far away from what we were expecting that we don’t think that it was due to random sampling error. Then is statistically significant. When p-value is below the alpha, we say “statistically significant”.. Low p-values are statistically significant.

Question 18

Q

What is the differnce between standard error and standard deviation?

Answer

A

Standard error is the typical distance a STATISTIC is from the mean in a sampling distribution (pile of a bunch of sample’s statistics) and Standard Error is the typical distance a DATUM is from the mean in a pile of raw data.

Question 19

Q

What does CLT say about the distribution of the population?

Answer

A

Not much… just that it doesn’t matter what it is.. With large samples.. The SAMPLING dist will be approx normal (dist of stats.. NOT DATA)

Question 20

Q

What are the mean and standard deviation of a sampling distribution for a proportion?

Answer

A

mean is p and sdandard deviation is root pq/n (look at formula sheet) N(p, root (pq/n) )

Question 21

Q

What does Central Limit Theorem Say?

Answer

A

It basically says.. NO MATTER WHAT SHAPE THE POPULATION IS (normal, bimodal, uniform, skewed, crazy.. ) If you make a histogram of a bunch of means taken from a bunch of samples, that histogram will be unimodal and symmetric WITH LARGE ENOUGH SAMPLES.. Close to normal. So.. A nerdy way to say it is: The sampling distribution of means is approximately normal no matter what the population is shaped like. The larger the sample size, the closer to normal. (the normal curve is just a model.. the sampling distribution is close to it, but not it! we use the model anyway!)

Question 22

Q

What is difference between population of interest and parameter of interest?

Answer

A

Population is the WHO (subjects you measure, beads people) Parameter is the actual number you want (like % of or AVG)

Question 23

Q

What happens to a pile of statistics if you take larger samples?

Answer

A

All of the x-bars or all of the p-hats will get closer to eachother, and closer to the parameter ( mu or p)

Question 24

Q

What does the CLT say about the distribution of actual sample data?

Answer

A

Nothing? The sample will be distributed similar to the population. Bimodal populations have bimodal samples. The CLT only talks about distributions (histograms) of sample statistics, of summaries, which are groups of means.., NOT OF INDIVIDUALS!!!! NOT DATA

Question 25

Q

N ( ?1 , ?2 ) what does this mean?

Answer

A

it means NORMAL models centered at ?1 With a standard deviation of ?2

Question 26

Q

Describe the distribution of a sample

Answer

A

It will look like the population. The distribution of a sample is a histogram made from the sample, which will look kind of like the population. If the population is bimodal, then the distribution of the sample is bimodal. The SAMPLING distribution of a bunch of means, however, will look normalish.

Question 27

Q

What is a standard error?

Answer

A

The typical, or expected, error. It is how far off you are expecting your statistic to be from the parameter. It is calculated like the standard deviation, but we are using sample statistics.. We don’t know the true parameters, so we estimate with statistics adding error to our calculation

Question 28

Q

How do statistics from big samples compare to small? (notice this doesn’t ask about DATA)

Answer

A

Larger sample statistics have less variablility, so statistics from larger samples are closer to eachother and to the parameter. Statistics from smaller samples are more spread out, further away from true parameter.

Question 29

Q

What is statistical inference?

Answer

A

Using a statistic to infer something about a parameter.. Basically, using a sample to say something about a population.

Question 30

Q

what is a statistic

Answer

A

some numerical summary of a sample.. Could be the mean of a sample, the standard deviation of a sample, the proportion of successes in a sample, the slope calculated from a sample, a difference of 2 means from 2 samples, a difference of 2 proportions from 2 samples, a difference of 2 slopes from 2 samples.. you can make sampling distributions for any of these, and they will all be centered around the parameter…

Question 31

Q

what is a parameter?

Answer

A

some numerical summary of a population. Often called “the parameter of interest.” It is what we are often trying to find.. It doesn’t vary. It is out there and STUCK at some value, it is the truth, and you’ll probably not ever know it! We try to catch them in our confidence intervals, but sometimes we don’t (and we don’t know it!). It Could be the mean of a population, the standard deviation of a population, the proportion of successes in a population, the slope calculated from a population, a difference of 2 means from 2 population, a difference of 2 proportions from population

Question 32

Q

What is the Fundemental Theorem of Statistics?

Answer

A

The CLT!! The Central Limit Theorem!

Question 33

Q

What is sampling variability?

Answer

A

same as sampling error. The natural variation of sample statistics.. NOT DATA.. Samples vary. so do their statistics.. Parameters do not vary!

Question 34

Q

What is sampling error?

Answer

A

same as sampling variability.. The natural variability between STATISTICS.. NOT DATA!!! . We call it error EVEN THOUGH YOU MADE NO MISTAKES!!!

Question 35

Q

what happens to t models as n gets larger?

Answer

A

The models look more like the normal model. An infinite sample size would give a t model identical to the normal model.

Question 36

Q

What is an unbiased estimator?

Answer

A

When the sampling distribution (pile of sample stats) is centered on the true population parameter.

Question 37

Q

how are t models like Normal models?

Answer

A

both are unimodal and symmetric. T models aren’t as high and have more area in tails, that’s why you have to reach out a little further than z for same confidence.

Question 38

Q

what is a biased estimator?

Answer

A

When the sampling distribution (pile of sample stats) is NOT centered on the true population parameter. If you were weighing people and there was a 1 pound weight on the scale, the pile would be centered 1 pound higher. Baised.

Question 39

Q

What are the mean and standard deviation of a sampling distribution for a mean?

Answer

A

mean is mu and standard deviation is sigma/root n (look at formula sheet) N(mu, sigma/rootn)

Question 40

Q

What if you want more confidence?

Answer

A

get a bigger net.. (wider conficence interval) (or increase sample size)

Question 41

Q

when do you need crits?

Answer

A

in confidence intervals (and old fashioned hyp tests.. We look at Z to see if greater than crit.)

Question 42

Q

how do you find z and t crit?

Answer

A

for z crit.. INVNORM(area in 1 tail) for t crit. INVT(area in 1 tail, deg freedom).

area in 1 tail is just ( 1-CL) / 2

Question 43

Q

What is a margin of error?

Answer

A

critical * s.d. It is how far you reach out in a confidence interval.. You reach up and down one of these, so the interval is actually 2 margins of error wide.

Question 44

Q

How is a confidence interval made?

Answer

A

statistic +- margin of error. Statistic +- (crit * s.d ). Stand at the statistic, reach out up and down a margin of error, and hope that you catch the parameter.

Question 45

Q

What is a critical value?

Answer

A

It is the amount of standard errors you’ll reach out, depending on your confidence (a t or z). Example.. 68% crit z = 1 .. For 95% crit z = 2 (well, 1.96).. For means.. Use t crits

Question 46

Q

what does “95% confidence” in a 95% confidence interval mean? (explain the confidence level)

Answer

A

It means if we took a ton of samples, and made confidence intervals from each of them,ABOUT 95% of the intervals would contain the parameter, 5% would not.

Question 47

Q

Do parameters vary?

Answer

A

NO!!! Statistics do because they are calculated from samples, different samples have different statistics. they vary from sample to sample. The parameter doesn’t vary because there is only one.

Question 48

Q

How wide is a confidence interval? (how many ME?)

Answer

A

It is 2 margins of error wide ALWAYS (DON’T CONFUSE WITH NUMBER OF SE)

Question 49

Q

What is a confidence interval?

Answer

A

it is a parameter catcher.. Like a fishing net. We stand at our statistic, and reach up and down a margin of error, and hope to CATCH the parameter? sometimes we do, sometimes we don’t? but we never know.. Mooo hooo hooo haaaa haaa haaa (evil laugh)

Question 50

Q

Can you make a 100% confidence interval?

Answer

A

Sure, I’m 100% confident that it will snow between 0 and 500 feet tomorrow.

Question 51

Q

Is a confidence interval a PROBABLILITY?

Question 52

Q

What are we confident in?

Answer

A

our confidence lies in our interval. if we took another sample.. We’d have a different interval..

Question 53

Q

Will 95% of other statistics be within my interval?

Answer

A

NO!!! You have no idea where your statistic is (or your interval) in regards to true parameter

Question 54

Q

What if you want more cofidence with same size interval?

Answer

A

increase your sample size

Question 55

Q

What are conficence intervals for?

Answer

A

PARAMETER CATCHERS. They are an attempt to say what the true population parameter is.. It is our best guess. “We think that there will be between 8 and 12 inches of snow”

Question 56

Q

How is a margin of error different from a standard error?

Answer

A

A margin of error is a NUMBER OF STANDARD ERRORS. It is how far up or down you go in a confidence interval. A standard error tells you about the spread of a pile of statistics (a sampling distribution).

Question 57

Q

What is a point estimate?

Answer

A

Your p-hat or your x-bar. Your best guess. What you got in your sample. It is in the middle of the interval.

Question 58

Q

What is a t-crit?

Answer

A

It is the same as z crit. It is the number of sd you reach out in your CI. To find it, do INVT(area in one tail, degrees of freedom)

Question 59

Q

How do you find Margin of Error from an inteval?

Answer

A

It is half the width.. (HI-LO divided by 2) Remember you stand at statistic (point estimate) and reach up and down a Margin of Error. So an inteval is always exactly 2 margins of error wide)

Question 60

Q

How can you tell if it is a T or a Z procedure?

Answer

A

YES-NO-PROP-Z. Remember t for means, z for proportions. Think of the subjects. Could you get the info in a yes/no fashion? if so, then z-props. Do you need to get a number from each subject? if so, then t-means.

Question 61

Q

how do you find deg freedom?

Answer

A

n-1 for one sample, for 2 samples you must use calculator. For PAIRED use n-1, REGRESSION IS n-2

Question 62

Q

How do you find point estimate from an interval?

Answer

A

It is in the dead center of interval, so take the average of the upper and lower bounds.

Question 63

Q

who invented the t model?

Answer

A

Bill Gosset, guiness brewing company.

Question 64

Q

interpret this 90% confidence interval for avg weight of mice (2.3, 3.5) in ounces

Answer

A

I am 90% confident that the mean weight of mice is between 2.3 and 3.5 ounces

Answer 64

A

90% of intervals made this way would catch the true mean weight. If you took 1000 samples and made 1000 intervals, about 900 intervals would catch the true weight, 100 would not.

Answer 65

A

Never accept a Ho, don’t keep the Null. simply “FAIL TO REJECT THE NULL”

Answer 66

A

set alpha= .02 and reject only p-values below that

Answer 67

A

a significance level

Answer 68

A

With such a low p-value, (p-value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

Answer 69

A

With a p-value this high (p-value > alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.

Answer 70

A

It is the probability of getting your sample randomly if the null were true.
Basically, how likely is it that your sample statistic came from the Null Model.

Answer 71

A

The DULL HYPOTHESIS, the nothing changed hypothesis, the no-difference hypothesis, the “he’s telling the truth” hypothesis, the “No trickery” hypothesis

Answer 72

A

The alternative.

Answer 73

A

if it just says “changed” or “different”.. Then it is 2 sided.. DOUBLE THE P VALUE!If it says “more” “less than” “greater” etc.. Then it is just one sided..

Answer 74

A

With p-value this low (show p value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

Answer 75

A

With a p-value this high(show p value < alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.

Answer 76

A

you reject when YOU HAVE EVIDENCE

Answer 77

A

you fail to reject when you DON’T HAVE EVIDENCE

Answer 78

A

NO.. We just fail to reject it.

Answer 79

A

use p-null..

Answer 80

A

Make your Ho and Ha 2. Make a Null Model (centered at null, use your Ho as center and in calculations, use your sample size).. This is a sampling distribution for the statistics if the null were true. 3. THINK then CHECK. use your statistic (p-hat, x-bar, phat1-phat2, xbar1-xbar2) to calculate your test statistic and then p value

Answer 81

A

a t or z score (or chi squared) that you use to find a p value

Answer 82

A

It is a sampling distribution. It tells us how sample statistics would vary if the null were true. It is centered at the null. A pile of p-hats or x-bars.

Answer 83

A

xbar is your sample mean, mu is your hypothesized population mean

Answer 84

A

It is the rejection area. Generally, we use .05. The significance level.

Answer 85

A

compare it to alpha. if p value

Answer 86

A

1 or 2 samples? Proportions (z) or Means (t)? Test or Interval?

(YES/NO/PROP/Z)

Answer 87

A

1 proportion Z interval
1 proportion Z test
1 sample mean T interval
1 sample mean T test

Answer 88

A

2 proportion Z interval
2 proportion Z test
2 sample mean T interval
2 sample mean T test