DECK 14: ALL INFERENCE MIXED Flashcards

1
Q

what is a biased estimator?

A

When the sampling distribution (pile of sample stats, x bars or p hats) is NOT centered on the true population parameter. If you were weighing people and there was a 1 pound weight on the scale, the pile would be centered 1 pound higher. Baised.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three chi-squared models?

A

goodness of fit, test for homogeneity, test for independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a critical value?

A

It is the amount of standard errors you’ll reach out, depending on your confidence (a t or z). Example.. 68% crit z = 1 .. For 95% crit z = 2 (well, 1.96).. For means.. Use t crits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

you reject when _____________ evidence

A

you reject when YOU HAVE EVIDENCE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In order to reject a null hypothesis, you need ___________

A

evidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the missed opportunity error? (the “I didn’t notice” error)

A

Type 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is alpha?

A

It is the rejection area. Generally, we use .05. The significance level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a sampLING distribution?

A

a pile of statistics. A pile of p-hats or x-bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the null model (the sampling distribution) in a 2 sample mean t-test?

A

a pile of differences of TWO MEANS samples, taken from a bunch of PAIRS of samples. Take two samples, calculate two means, subtract to get a difference, PUT THE DIFFERENCE IN THE PILE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the NULL HYPOTHESIS?

A

The DULL HYPOTHESIS, the nothing changed hypothesis, the no-difference hypothesis, the “he’s telling the truth” hypothesis, the “No trickery” hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is a margin of error different from a standard error?

A

A margin of error is a NUMBER OF STANDARD ERRORS. It is how far up or down you go in a confidence interval. A standard error tells you about the spread of a pile of statistics (a sampling distribution).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the differnce between standard error and standard deviation?

A

Standard error is the typical distance a STATISTIC is from the mean in a sampling distribution (pile of a bunch of sample’s statistics) and Standard Error is the typical distance a DATUM is from the mean in a pile of raw data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

For both a chi squared test for independence and a regression t test you are looking for an association, how do the null hypotheses differ?

A

The chi squared will be in words.. Ho: the variables are not associated. The regression t test will be with symbols (and words). Ho: Beta = 0 (the slope is zero) . Saying “beta=0” is the same as saying “there is no association”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

when do you need crits?

A

in confidence intervals (and old fashioned hyp tests.. We look at Z to see if greater than crit.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

you fail to reject when ____________ evidence

A

you fail to reject when you DON’T HAVE EVIDENCE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

notation: what is x-bar

A

mean of your sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the null for a chi squared test for homogeneity?

A

The [samples of —] are similarly distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How is a paired T test different from a 2 sample mean T test?

A

A paired test talks about an AVERAGE OF DIFFERENCES from one list, whereas a 2 sample mean t-test talks about a DIFFERENCE OF AVERAGES between two samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

when is data “paired”

A

when you have 2 measurements of the same variable on the same subject (or matched subjects)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the null for a 2 sample mean T?

A

mu1=mu2 OR mu1-mu2=0 there is no diff

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When you are doing PAIRED or MATCHED or BLOCKED tests.. What are you finding?

A

The average difference.. You are doing 1 sample procedures on a NEW THIRD LIST OF DIFFERENCES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a confidence interval?

A

it is a parameter catcher.. Like a fishing net. We stand at our statistic, and reach up and down a margin of error, and hope to CATCH the parameter? sometimes we do, sometimes we don’t? but we never know.. Mooo hooo hooo haaaa haaa haaa (evil laugh)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does CLT say about the distribution of the population?

A

Not much… just that it doesn’t matter what it is.. With large samples.. The SAMPLING dist will be approx normal (dist of stats.. NOT DATA)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is difference between population of interest and parameter of interest?

A

Population is the WHO (subjects you measure, beads people) Parameter is the actual number you want (like % of or AVG)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

what is a parameter?

A

some numerical summary of a population. Often called “the parameter of interest.” It is what we are often trying to find.. It doesn’t vary. It is out there and STUCK at some value, it is the truth, and you’ll probably not ever know it! We try to catch them in our confidence intervals, but sometimes we don’t (and we don’t know it!). It Could be the mean of a population, the standard deviation of a population, the proportion of successes in a population, the slope calculated from a population, a difference of 2 means from 2 population, a difference of 2 proportions from population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How can you tell if it is a T or a Z procedure?

A

YES-NO-PROP-Z. Remember t for means, z for proportions. Think of the subjects. Could you get the info in a yes/no fashion? if so, then z-props. Do you need to get a number from each subject? if so, then t-means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Can you make a 100% confidence interval?

A

Sure, I’m 100% confident that it will snow between 0 and 500 feet tomorrow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

How do you find Margin of Error from an inteval?

A

It is half the width.. (HI-LO divided by 2) Remember you stand at statistic (point estimate) and reach up and down a Margin of Error. So an inteval is always exactly 2 margins of error wide)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

what does “95% confidence” in a 95% confidence interval mean? (explain the confidence level)

A

It means if we took a ton of samples, and made confidence intervals from each of them,ABOUT 95% of the intervals would contain the parameter, 5% would not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

sample size calcs FOR PROP AND MEANS

A

n= (z^2 * p * q )/ (ME ^2) and n = ( t*s / ME) ^ 2 (start with Z then do T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What does the CLT say about the distribution of actual sample data?

A

Nothing? The sample will be distributed similar to the population. Bimodal populations have bimodal samples. The CLT only talks about distributions (histograms) of sample statistics, of summaries, which are groups of means.., NOT OF INDIVIDUALS!!!! NOT DATA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Which hypothesis shows what you are trying to prove?

A

The alternative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What happens to a pile of statistics if you take larger samples?

A

All of the x-bars or all of the p-hats will get closer to eachother, and closer to the parameter ( mu or p)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

If you are doing a 2 tailed test with alpha=.05.. What confidence interval goes with that?

A

95% confidence interval (there is .025 in each tail)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

who invented the t model?

A

Bill Gosset, guiness brewing company.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

how do you find deg freedom?

A

n-1 for one sample, for 2 samples you must use calculator. For PAIRED use n-1, REGRESSION IS n-2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is beta?

A

It is probability that you’ll make a Type II error.. P(Type II error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is “statistically significant?”

A

When our sample statistic is so far away from what we were expecting that we don’t think that it was due to random sampling error. Then is statistically significant. When p-value is below the alpha, we say “statistically significant”.. Low p-values are statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

how are t models like Normal models?

A

both are unimodal and symmetric. T models aren’t as high and have more area in tails, that’s why you have to reach out a little further than z for same confidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is a way to think about the three conditions?

A
  1. Sample is random2. Sample is small enough (<10%)3. Sample is large enough (np&nq>10 for props, n>30 for means or the histogram is normalish)EXTRAS: chi squared exp at least 5 in each cell, regression- random resid
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the null model(the sampling distribution) in a 1-sample mean T test?

A

A pile of means from a bunch of samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

notation: what is mu

A

true population mean (average)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

notation: what is z*

A

critical z, how many SE you are reaching up and down in a confidence interval for proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is the null for a chi squared GOF test?

A

The distribution fits [the expected distribution]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

How do you find point estimate from an interval?

A

It is in the dead center of interval, so take the average of the upper and lower bounds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

How do you find df in 2 samples?

A

USE CALCULATOR.(or smaller sample-1). you have to run an interval or a test on your TI and read the output (unless you want to use the equation.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What if you want more cofidence with same size interval?

A

increase your sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Can you draw the alpha/beta/power diagram?

A

BE ABLE TO SKETCH THE ALPHA BETA POWER DIAGRAM from the original pregnancy worksheet. Know where everything is. This helps you understand how alpha, beta and power interact.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

How else can you explain power?

A

The likelihood you correctly reject a false null.. The likelihood you correctly detect what you were trying to detect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is a null model?

A

It is a sampling distribution. It tells us how sample statistics would vary if the null were true. It is centered at the null. A pile of p-hats or x-bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

notation: what is t*

A

critical t, how many SE you are reaching up and down in a confidence interval for means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Describe the distribution of a sample

A

It will look like the population. The distribution of a sample is a histogram made from the sample, which will look kind of like the population. If the population is bimodal, then the distribution of the sample is bimodal. The SAMPLING distribution of a bunch of means, however, will look normalish.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is an unbiased estimator?

A

When the sampling distribution (pile of sample stats) is centered on the true population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is the “you think it worked but it didn’t” error?

A

Type 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is a point estimate?

A

Your p-hat or your x-bar. Your best guess. What you got in your sample. It is in the middle of the interval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

how can you decide the right test? What are the 3 questions?

A

1 or 2 samples? Proportions (z) or Means (t)? Test or Interval?(YES/NO/PROP/Z)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

If the p-value is low, (below alpha), how do you write conclusion?

A

With p-value this low (show p value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What is a t-crit?

A

It is the same as z crit. It is the number of sd you reach out in your CI. To find it, do INVT(area in one tail, degrees of freedom)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

How are power and alpha related?

A

they go up and down together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What are we confident in?

A

our confidence lies in our interval. if we took another sample.. We’d have a different interval..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What are the xbar and mu in t-test? (on calc)

A

xbar is your sample mean, mu is your hypothesized population mean

62
Q

What if you want more confidence?

A

get a bigger net.. (wider conficence interval) (or increase sample size)

63
Q

Will 95% of other statistics be within my interval?

A

NO!!! You have no idea where your statistic is (or your interval) in regards to true parameter

64
Q

What are the mean and standard deviation of a sampling distribution for a mean?

A

mean is mu and standard deviation is sigma/root n (look at formula sheet) N(mu, sigma/rootn)

65
Q

What are the names of four two-sample procedures we do?

A

2 proportion Z interval2 proportion Z test2 sample mean T interval2 sample mean T test

66
Q

If there is a screening test for mathphobia, describe a type 1 and a type 2 error

A

Type 1: You think the person has mathphobia, but they don’tType 2: They have mathphobia, but you didn’t notice

67
Q

What are conficence intervals for?

A

PARAMETER CATCHERS. They are an attempt to say what the true population parameter is.. It is our best guess. “We think that there will be between 8 and 12 inches of snow”

68
Q

What is the difference between the distribution of a sample and a sampling distribution?

A

A distribution of a sample is just a histogram of the DATA in a sample. A sampling distribution is made from an bunch of sample STATISTICS. It is the distribution of the statistic that was calculated from those many many samples.

69
Q

N ( ?1 , ?2 ) what does this mean?

A

it means NORMAL models centered at ?1 With a standard deviation of ?2

70
Q

What are the 3 steps in hypothesis testing AFTER YOU CHECK CONDITIONS?

A
  1. Make your Ho and Ha 2. Make a Null Model (centered at null, use your Ho as center and in calculations, use your sample size).. This is a sampling distribution for the statistics if the null were true. 3. THINK then CHECK. use your statistic (p-hat, x-bar, phat1-phat2, xbar1-xbar2) to calculate your test statistic and then p value
71
Q

notation: what is p-hat

A

sample proportion (percent in our sample)

72
Q

how are beta and power related

A

as one increases, the other decreases, and vice versa. They have to because they BOTH ADD TO ONE!!! Power + Beta = 1

73
Q

what is another name for alpha level?

A

a significance level

74
Q

Can you decrease alpha while increasing power (even though they move together?)..

A

Yes.. increase samle size. They move together with constant sample size.

75
Q

What does Central Limit Theorem Say?

A

It basically says.. NO MATTER WHAT SHAPE THE POPULATION IS (normal, bimodal, uniform, skewed, crazy.. ) If you make a histogram of a bunch of means taken from a bunch of samples, that histogram will be unimodal and symmetric WITH LARGE ENOUGH SAMPLES.. Close to normal. So.. A nerdy way to say it is: The sampling distribution of means is approximately normal no matter what the population is shaped like. The larger the sample size, the closer to normal. (the normal curve is just a model.. the sampling distribution is close to it, but not it! we use the model anyway!)

76
Q

If you take two bits of information from each subject there are two possible tests you could do. If the data is quantitative, you can do___ and if it is categorical, you can do a ______

A

regression t test, chi-squared test for independence.

77
Q

notation: What is Ha

A

The alternative. This is what you are trying to prove.

78
Q

What is the null model (the sampling distribution) in a 2-proporiton Z test?

A

a pile of differences of TWO PROPORTIONS, taken from a bunch of PAIRS of samples. Take two samples, calculate proportions, subtract to get a difference, PUT THE DIFFERENCE IN THE PILE.

79
Q

With regression computer output, how is the t-ratio and the p-value calculated?

A

T ratio is just SLOPE/ST ERROR and the p value is just TCDF(T ratio, 9999, n-2)

80
Q

What is a Chi squared model?

A

A sampling distritubion. If you took a bunch of samples and calculated a bunch of Chi-square statistics, the pile of chi squareds would look like that.

81
Q

what are the names of four one-sample procedures we do?

A

1 proportion Z interval1 proportion Z test1 sample mean T interval1 sample mean T test

82
Q

What is the mean and mode of a chi squared model? What is the 5% cutoff chi-squared?

A

The mean is the degrees of freedom and the mode is df-2. The cutoff is at 1.5df+3.

83
Q

What is alpha?

A

It is the rejection threshold. You reject p-values below it.. It is how willing you are to make a Type 1 error? alpa=P(Type I error)

84
Q

notation: what is a p-value

A

At the end of a hypothesis test, it is the likelihood of getting your results if the null was true.

85
Q

what is a statistic

A

some numerical summary of a sample.. Could be the mean of a sample, the standard deviation of a sample, the proportion of successes in a sample, the slope calculated from a sample, a difference of 2 means from 2 samples, a difference of 2 proportions from 2 samples, a difference of 2 slopes from 2 samples.. you can make sampling distributions for any of these, and they will all be centered around the parameter…

86
Q

what does “ 90% confidence” mean in a 90% interval for avg weight of mice (2.3, 3.5) in ounces

A

90% of intervals made this way would catch the true mean weight. If you took 1000 samples and made 1000 intervals, about 900 intervals would catch the true weight, 100 would not.

87
Q

Do parameters vary?

A

NO!!! Statistics do because they are calculated from samples, different samples have different statistics. they vary from sample to sample. The parameter doesn’t vary because there is only one.

88
Q

If you have one group and ask them two questions, what type of chi squared test is it?

A

Test for independence.

89
Q

Do you use p-hat or p-null when you calculate your standard error in a null model?

A

use p-null..

90
Q

What is diff between homogeneity and test for independence?

A

homogeneity is more than one sample and asking about one variable, independence is just one sample with two variables.

91
Q

What is sampling error?

A

same as sampling variability.. The natural variability between STATISTICS.. NOT DATA!!! . We call it error EVEN THOUGH YOU MADE NO MISTAKES!!!

92
Q

what is df for goodness of fit?

A

cells - 1

93
Q

DEGREES OF FREEEDOM: one sample t, two sample T, regression, chi square gof, chi square hom/indep

A

one sample t: n-1two sample t: calc or smaller n-1regression: n-2Chi square GOF: cells-1CHi square hom/indep: ROW-1 x COL-1

94
Q

notation: what is mu - mu

A

true difference between two populatinon means

95
Q

What is the null model (the sampling distribution) in a regression t test for slope?

A

A pile of slopes taken from a bunch of samples

96
Q

How do you make confidence interval for slope?

A

STAT +/- CRIT SE of slope

97
Q

What are the mean and standard deviation of a sampling distribution for a proportion?

A

mean is p and sdandard deviation is root pq/n (look at formula sheet) N(p, root (pq/n) )

98
Q

How can you increase power?

A

Increase alpha or increase sample size..

99
Q

What is a standard error?

A

The typical, or expected, error. It is how far off you are expecting your statistic to be from the parameter. It is calculated like the standard deviation, but we are using sample statistics.. We don’t know the true parameters, so we estimate with statistics adding error to our calculation

100
Q

If you are testing to see if a math program works in your town (that will cost 4 million to implement) describe a type 1 and type 2 error.

A

Type 1: you think it worked, but it didn’t so you spend 4 million on a program that isn’t good.Type 2: It worked, but you didn’t notice, so you miss the opportunity to adopt a good math program.

101
Q

If you take two bits of information from each subject there are two possible tests you could do. If the a chi squared test would be for _____ data and a regression t-test would be for ______ data

A

categorical, quantitative.

102
Q

notation: what is p - p

A

true difference between two population proportions (percents).

103
Q

How do you write conclusion if you reject?

A

With such a low p-value, (p-value < alpha) I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.

104
Q

how do you find z and t crit?

A

for z crit.. INVNORM(area in 1 tail) for t crit. INVT(area in 1 tail, deg freedom). area in 1 tail is just ( 1-CL) / 2

105
Q

WHat is the null for a chi squared test for independence”

A

The [two variables in context] are independent.

106
Q

What is the “nearly normal” condition for means? (aka large enough sample)

A

If n>30, good to go. If n<30, then you have to make sure the histogram of the sample looks normalish.

107
Q

What is difference between 2 Samp T test and a PAIRED T Test

A

In a two sample T test you are comparing TWO SAMPLE AVERAGES to eachother. In a PAIRED T test you are looking just at JUST ONE average of the THIRD LIST… They are paired.. So you find each individual BEFORE-AFTER and take the average of all of those differences. You do ONE SAMPLE T TEST on it because you really have one mean. You just the average or the difference list.

108
Q

What is a p-value

A

It is the probability of getting your sample randomly if the null were true. Basically, how likely is it that your sample statistic came from the Null Model.

109
Q

What is the null model (the sampling distribution) in a chi squared test?

A

a pile of chi-squared statistics calculated from a bunch of samples

110
Q

How do you find Expected Count?

A

for GOF: Exp %(total).. For indep and homog: ROW*COL/TOTAL

111
Q

Can you prove a null hypothesis true?

A

NO.. We just fail to reject it.

112
Q

What is the other conditions for regression inference?

A

Random residuals (equal random scatter, no pattern).

113
Q

How can you decrease alpha and beta at the same time?

A

increase sample size. this will also increase power.

114
Q

interpret this 90% confidence interval for avg weight of mice (2.3, 3.5) in ounces

A

I am 90% confident that the mean weight of mice is between 2.3 and 3.5 ounces

115
Q

What is the “nearly normal” condition for proportions? (aka large enough sample)

A

np>10 and nq>10. actually Show this calculation

116
Q

IF you are testing to see if a marketing program increases sales, describe a type 1 and type 2 error.

A

Type 1: you think it increased sales but it didn’gType 2: It actually increased sales but you didn’t notice

117
Q

What do you have to look for with two sample procedures:

A
  1. You have to check the conditions for each sample AND2. The samples have to be independent from eachother
118
Q

Is a confidence interval a PROBABLILITY?

A

NO

119
Q

When do you know it is GOF test?

A

When you have ONE ROW or ONE COLUMN… then it gives you a ratio , like 1:2:5 or it gives you expected percents.

120
Q

What is the general formula for ALL CONFIDENCE INTERVALS?

A

STAT +/- CRIT SE

121
Q

notation: what is xbar- xbar

A

difference between two sample means

122
Q

what is df for chi squared homogeneity or independence?

A

(rows-1)(columns - 1) (remove a column and row.. Count boxes)

123
Q

What is the null model (the sampling distribution) in a 1-proportion Z test?

A

A pile of proportions (%) from a bunch of samples.

124
Q

How do you write conclusion if you fail to reject?

A

With a p-value this high (p-value > alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.

125
Q

what is a test statistic?

A

a t or z score (or chi squared) that you use to find a p value

126
Q

WHat is the null model (the sampling distribution) in a paired T-Test?

A

A pile of average differences. Remember that in a paired test, you are getting an individual differenc from each pair of data, then finding the average of differences.

127
Q

What is the “ large enough sample” condition for Chi Square?

A

Make sure that there are at least 5 in each expected cell.

128
Q

One tail or 2 tailed? How do you tell?

A

if it just says “changed” or “different”.. Then it is 2 sided.. DOUBLE THE P VALUE!If it says “more” “less than” “greater” etc.. Then it is just one sided..

129
Q

notation: what is p

A

true population proportion (percent in the population)

130
Q

Can you accept a null hypothesis? Can you say “keep the null?”

A

Never accept a Ho, don’t keep the Null. simply “FAIL TO REJECT THE NULL”

131
Q

How wide is a confidence interval? (how many ME?)

A

It is 2 margins of error wide ALWAYS (DON’T CONFUSE WITH NUMBER OF SE)

132
Q

what happens to t models as n gets larger?

A

The models look more like the normal model. An infinite sample size would give a t model identical to the normal model.

133
Q

If you have one group and ask them one question, what type of chi-squared test is it?

A

Goodness of fit test.

134
Q

What does a “significance level of .02” mean?

A

set alpha= .02 and reject only p-values below that

135
Q

What do we compare our p value to in a hypothesis test?

A

compare it to alpha. if p value

136
Q

If you are doing a one tailed test with alpha=.05.. What confidence interval goes with that?

A

90% confidence interval tests a one tailed test. There is 5% in the tail.

137
Q

notation: what is phat - phat

A

difference between two sample proportions

138
Q

If you have multiple groups and ask them one question, what Chi-squared test is it?

A

Test for homogeneity.

139
Q

What is the Fundemental Theorem of Statistics?

A

The CLT!! The Central Limit Theorem!

140
Q

Are models what really happen?

A

No. A model train is not a real train. We use models to say what kind of happens.

141
Q

What is the “nearly normal” condition for regression? (aka large enough sample)

A

Make sure the histogram of the residuals is normalish.

142
Q

How do statistics from big samples compare to small? (notice this doesn’t ask about DATA)

A

Larger sample statistics have less variablility, so statistics from larger samples are closer to eachother and to the parameter. Statistics from smaller samples are more spread out, further away from true parameter.

143
Q

What is sampling variability?

A

same as sampling error. The natural variation of sample statistics.. NOT DATA.. Samples vary. so do their statistics.. Parameters do not vary!

144
Q

How is a confidence interval made?

A

statistic +- margin of error. Statistic +- (crit * s.d ). Stand at the statistic, reach out up and down a margin of error, and hope that you catch the parameter.

145
Q

What is a margin of error?

A

critical * s.d. It is how far you reach out in a confidence interval.. You reach up and down one of these, so the interval is actually 2 margins of error wide.

146
Q

What is the null for a paired T test?

A

xbar diff=0 (the average diff is zero)

147
Q

What is statistical inference?

A

Using a statistic to infer something about a parameter.. Basically, using a sample to say something about a population.

148
Q

CONDITIONS: What are the three conditions that you have to check with pretty much every inference procedure (every test and every interval)

A
  1. Randomly chosen sample (or assigned treatment). Circle the word random or explain why you think it is. 2. Sample size is less than 10% of the population. Show that 10n is less than N. Example, for 50 students, write 10(50)=500 is less than all students.3. Nearly normal (or large enough sample)- this differs based on the type of data and test.
149
Q

What is the null for a 2 prop Z?

A

p1=p2 OR… p1-p2=0, there is no diff

150
Q

notation: what is Ho

A

The NULL, the dull, the “things haven’t changed” hypothesis

151
Q

If the p-value is high, (above alpha), how do you write conclusion?

A

With a p-value this high(show p value < alpha) I fail to reject the null. There is not enough evidence to say that more students like eggs now.