DECK 14: ALL INFERENCE MIXED Flashcards

Question

what is a parameter?

Answer 1

some numerical summary of a population. Often called "the parameter of interest." It is what we are often trying to find.. It doesn't vary. It is out there and STUCK at some value, it is the truth, and you'll probably not ever know it! We try to catch them in our confidence intervals, but sometimes we don't (and we don't know it!). It Could be the mean of a population, the standard deviation of a population, the proportion of successes in a population, the slope calculated from a population, a difference of 2 means from 2 population, a difference of 2 proportions from population

Answer 2

YES-NO-PROP-Z. Remember t for means, z for proportions. Think of the subjects. Could you get the info in a yes/no fashion? if so, then z-props. Do you need to get a number from each subject? if so, then t-means.

Answer 3

Sure, I'm 100% confident that it will snow between 0 and 500 feet tomorrow.

Answer 4

It is half the width.. (HI-LO divided by 2) Remember you stand at statistic (point estimate) and reach up and down a Margin of Error. So an inteval is always exactly 2 margins of error wide)

Answer 5

It means if we took a ton of samples, and made confidence intervals from each of them,ABOUT 95% of the intervals would contain the parameter, 5% would not.

Answer 6

n= (z^2 * p * q )/ (ME ^2) and n = ( t*s / ME) ^ 2 (start with Z then do T)

Answer 7

Nothing? The sample will be distributed similar to the population. Bimodal populations have bimodal samples. The CLT only talks about distributions (histograms) of sample statistics, of summaries, which are groups of means.., NOT OF INDIVIDUALS!!!! NOT DATA

Answer 8

The alternative.

Answer 9

All of the x-bars or all of the p-hats will get closer to eachother, and closer to the parameter ( mu or p)

Answer 10

95% confidence interval (there is .025 in each tail)

Answer 11

Bill Gosset, guiness brewing company.

Answer 12

n-1 for one sample, for 2 samples you must use calculator. For PAIRED use n-1, REGRESSION IS n-2

Answer 13

It is probability that you'll make a Type II error.. P(Type II error)

Answer 14

When our sample statistic is so far away from what we were expecting that we don't think that it was due to random sampling error. Then is statistically significant. When p-value is below the alpha, we say "statistically significant".. Low p-values are statistically significant.

Answer 15

both are unimodal and symmetric. T models aren't as high and have more area in tails, that’s why you have to reach out a little further than z for same confidence.

Answer 16

1. Sample is random2. Sample is small enough (<10%)3. Sample is large enough (np&nq>10 for props, n>30 for means or the histogram is normalish)EXTRAS: chi squared exp at least 5 in each cell, regression- random resid

Answer 17

A pile of means from a bunch of samples.

Answer 18

true population mean (average)

Answer 19

critical z, how many SE you are reaching up and down in a confidence interval for proportions

Answer 20

The distribution fits [the expected distribution]

Answer 21

It is in the dead center of interval, so take the average of the upper and lower bounds.

Answer 22

USE CALCULATOR.(or smaller sample-1). you have to run an interval or a test on your TI and read the output (unless you want to use the equation.)

Answer 23

increase your sample size

Answer 24

BE ABLE TO SKETCH THE ALPHA BETA POWER DIAGRAM from the original pregnancy worksheet. Know where everything is. This helps you understand how alpha, beta and power interact.

Answer 25

The likelihood you correctly reject a false null.. The likelihood you correctly detect what you were trying to detect

Answer 26

It is a sampling distribution. It tells us how sample statistics would vary if the null were true. It is centered at the null. A pile of p-hats or x-bars.

Answer 27

critical t, how many SE you are reaching up and down in a confidence interval for means

Answer 28

It will look like the population. The distribution of a sample is a histogram made from the sample, which will look kind of like the population. If the population is bimodal, then the distribution of the sample is bimodal. The SAMPLING distribution of a bunch of means, however, will look normalish.

Answer 29

When the sampling distribution (pile of sample stats) is centered on the true population parameter.

Answer 30

Your p-hat or your x-bar. Your best guess. What you got in your sample. It is in the middle of the interval.

Answer 31

1 or 2 samples? Proportions (z) or Means (t)? Test or Interval?(YES/NO/PROP/Z)

Answer 32

With p-value this low (show p value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

Answer 33

It is the same as z crit. It is the number of sd you reach out in your CI. To find it, do INVT(area in one tail, degrees of freedom)

Answer 34

they go up and down together

Answer 35

our confidence lies in our interval. if we took another sample.. We'd have a different interval..

Answer 36

xbar is your sample mean, mu is your hypothesized population mean

Answer 37

get a bigger net.. (wider conficence interval) (or increase sample size)

Answer 38

NO!!! You have no idea where your statistic is (or your interval) in regards to true parameter

Answer 39

mean is mu and standard deviation is sigma/root n (look at formula sheet) N(mu, sigma/rootn)

Answer 40

2 proportion Z interval2 proportion Z test2 sample mean T interval2 sample mean T test

Answer 41

Type 1: You think the person has mathphobia, but they don'tType 2: They have mathphobia, but you didn't notice

Answer 42

PARAMETER CATCHERS. They are an attempt to say what the true population parameter is.. It is our best guess. "We think that there will be between 8 and 12 inches of snow"

Answer 43

A distribution of a sample is just a histogram of the DATA in a sample. A sampling distribution is made from an bunch of sample STATISTICS. It is the distribution of the statistic that was calculated from those many many samples.

Answer 44

it means NORMAL models centered at ?1 With a standard deviation of ?2

Answer 45

1. Make your Ho and Ha 2. Make a Null Model (centered at null, use your Ho as center and in calculations, use your sample size).. This is a sampling distribution for the statistics if the null were true. 3. THINK then CHECK. use your statistic (p-hat, x-bar, phat1-phat2, xbar1-xbar2) to calculate your test statistic and then p value

Answer 46

sample proportion (percent in our sample)

Answer 47

as one increases, the other decreases, and vice versa. They have to because they BOTH ADD TO ONE!!! Power + Beta = 1

Answer 48

a significance level

Answer 49

Yes.. increase samle size. They move together with constant sample size.

Answer 50

It basically says.. NO MATTER WHAT SHAPE THE POPULATION IS (normal, bimodal, uniform, skewed, crazy.. ) If you make a histogram of a bunch of means taken from a bunch of samples, that histogram will be unimodal and symmetric WITH LARGE ENOUGH SAMPLES.. Close to normal. So.. A nerdy way to say it is: The sampling distribution of means is approximately normal no matter what the population is shaped like. The larger the sample size, the closer to normal. (the normal curve is just a model.. the sampling distribution is close to it, but not it! we use the model anyway!)

Answer 51

regression t test, chi-squared test for independence.

Answer 52

The alternative. This is what you are trying to prove.

Answer 53

a pile of differences of TWO PROPORTIONS, taken from a bunch of PAIRS of samples. Take two samples, calculate proportions, subtract to get a difference, PUT THE DIFFERENCE IN THE PILE.

Answer 54

T ratio is just SLOPE/ST ERROR and the p value is just TCDF(T ratio, 9999, n-2)

Answer 55

A sampling distritubion. If you took a bunch of samples and calculated a bunch of Chi-square statistics, the pile of chi squareds would look like that.

Answer 56

1 proportion Z interval1 proportion Z test1 sample mean T interval1 sample mean T test

Answer 57

The mean is the degrees of freedom and the mode is df-2. The cutoff is at 1.5df+3.

Answer 58

It is the rejection threshold. You reject p-values below it.. It is how willing you are to make a Type 1 error? alpa=P(Type I error)

Answer 59

At the end of a hypothesis test, it is the likelihood of getting your results if the null was true.

Answer 60

some numerical summary of a sample.. Could be the mean of a sample, the standard deviation of a sample, the proportion of successes in a sample, the slope calculated from a sample, a difference of 2 means from 2 samples, a difference of 2 proportions from 2 samples, a difference of 2 slopes from 2 samples.. you can make sampling distributions for any of these, and they will all be centered around the parameter...

Answer 61

90% of intervals made this way would catch the true mean weight. If you took 1000 samples and made 1000 intervals, about 900 intervals would catch the true weight, 100 would not.

Answer 62

NO!!! Statistics do because they are calculated from samples, different samples have different statistics. they vary from sample to sample. The parameter doesn't vary because there is only one.

Answer 63

Test for independence.

Answer 64

use p-null..

Answer 65

homogeneity is more than one sample and asking about one variable, independence is just one sample with two variables.

Answer 66

same as sampling variability.. The natural variability between STATISTICS.. NOT DATA!!! . We call it error EVEN THOUGH YOU MADE NO MISTAKES!!!

Answer 67

one sample t: n-1two sample t: calc or smaller n-1regression: n-2Chi square GOF: cells-1CHi square hom/indep: ROW-1 x COL-1

Answer 68

true difference between two populatinon means

Answer 69

A pile of slopes taken from a bunch of samples

Answer 70

STAT +/- CRIT SE of slope

Answer 71

mean is p and sdandard deviation is root pq/n (look at formula sheet) N(p, root (pq/n) )

Answer 72

Increase alpha or increase sample size..

Answer 73

The typical, or expected, error. It is how far off you are expecting your statistic to be from the parameter. It is calculated like the standard deviation, but we are using sample statistics.. We don't know the true parameters, so we estimate with statistics adding error to our calculation

Answer 74

Type 1: you think it worked, but it didn't so you spend 4 million on a program that isn't good.Type 2: It worked, but you didn't notice, so you miss the opportunity to adopt a good math program.

Answer 75

categorical, quantitative.

Answer 76

true difference between two population proportions (percents).

Answer 77

With such a low p-value, (p-value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

Answer 78

for z crit.. INVNORM(area in 1 tail) for t crit. INVT(area in 1 tail, deg freedom). area in 1 tail is just ( 1-CL) / 2

Answer 79

The [two variables in context] are independent.

Answer 80

If n>30, good to go. If n<30, then you have to make sure the histogram of the sample looks normalish.

Answer 81

In a two sample T test you are comparing TWO SAMPLE AVERAGES to eachother. In a PAIRED T test you are looking just at JUST ONE average of the THIRD LIST… They are paired.. So you find each individual BEFORE-AFTER and take the average of all of those differences. You do ONE SAMPLE T TEST on it because you really have one mean. You just the average or the difference list.

Answer 82

It is the probability of getting your sample randomly if the null were true. Basically, how likely is it that your sample statistic came from the Null Model.

Answer 83

a pile of chi-squared statistics calculated from a bunch of samples

Answer 84

for GOF: Exp %(total).. For indep and homog: ROW*COL/TOTAL

Answer 85

NO.. We just fail to reject it.

Answer 86

Random residuals (equal random scatter, no pattern).

Answer 87

increase sample size. this will also increase power.

Answer 88

I am 90% confident that the mean weight of mice is between 2.3 and 3.5 ounces

Answer 89

np>10 and nq>10. actually Show this calculation

Answer 90

Type 1: you think it increased sales but it didn'gType 2: It actually increased sales but you didn't notice

Answer 91

1. You have to check the conditions for each sample AND2. The samples have to be independent from eachother

Answer 92

When you have ONE ROW or ONE COLUMN… then it gives you a ratio , like 1:2:5 or it gives you expected percents.

Answer 93

STAT +/- CRIT SE

Answer 94

difference between two sample means

Answer 95

(rows-1)(columns - 1) (remove a column and row.. Count boxes)

Answer 96

A pile of proportions (%) from a bunch of samples.

Answer 97

With a p-value this high (p-value > alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.

Answer 98

a t or z score (or chi squared) that you use to find a p value

Answer 99

A pile of average differences. Remember that in a paired test, you are getting an individual differenc from each pair of data, then finding the average of differences.

Answer 100

Make sure that there are at least 5 in each expected cell.

Answer 101

if it just says "changed" or "different".. Then it is 2 sided.. DOUBLE THE P VALUE!If it says "more" "less than" "greater" etc.. Then it is just one sided..

Answer 102

true population proportion (percent in the population)

Answer 103

Never accept a Ho, don't keep the Null. simply "FAIL TO REJECT THE NULL"

Answer 104

It is 2 margins of error wide ALWAYS (DON'T CONFUSE WITH NUMBER OF SE)

Answer 105

The models look more like the normal model. An infinite sample size would give a t model identical to the normal model.

Answer 106

Goodness of fit test.

Answer 107

set alpha= .02 and reject only p-values below that

Answer 108

compare it to alpha. if p value

Answer 109

90% confidence interval tests a one tailed test. There is 5% in the tail.

Answer 110

difference between two sample proportions

Answer 111

Test for homogeneity.

Answer 112

The CLT!! The Central Limit Theorem!

Answer 113

No. A model train is not a real train. We use models to say what kind of happens.

Answer 114

Make sure the histogram of the residuals is normalish.

Answer 115

Larger sample statistics have less variablility, so statistics from larger samples are closer to eachother and to the parameter. Statistics from smaller samples are more spread out, further away from true parameter.

Answer 116

same as sampling error. The natural variation of sample statistics.. NOT DATA.. Samples vary. so do their statistics.. Parameters do not vary!

Answer 117

statistic +- margin of error. Statistic +- (crit * s.d ). Stand at the statistic, reach out up and down a margin of error, and hope that you catch the parameter.

Answer 118

critical * s.d. It is how far you reach out in a confidence interval.. You reach up and down one of these, so the interval is actually 2 margins of error wide.

Answer 119

xbar diff=0 (the average diff is zero)

Answer 120

Using a statistic to infer something about a parameter.. Basically, using a sample to say something about a population.

Answer 121

1. Randomly chosen sample (or assigned treatment). Circle the word random or explain why you think it is. 2. Sample size is less than 10% of the population. Show that 10n is less than N. Example, for 50 students, write 10(50)=500 is less than all students.3. Nearly normal (or large enough sample)- this differs based on the type of data and test.

Answer 122

p1=p2 OR... p1-p2=0, there is no diff

Answer 123

The NULL, the dull, the "things haven't changed" hypothesis

Answer 124

With a p-value this high(show p value < alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.

DECK 14: ALL INFERENCE MIXED Flashcards

(151 cards)