ALL INFERENCE U5, U6 and U7 MIXED Flashcards
How do you find df in 2 samples?
EASY WAY: smaller sample size minus one.
this is a conservative guess
or the hard way
you have to run an interval or a test on your TI and read the output (unless you want to use the equation?.)
How can you decrease alpha and beta at the same time?
increase sample size.
this will also increase power
What if you want more cofidence with same size interval?
increase your sample size
For the following output for the association between test score and amount of time studied, What is the “5.82?”
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
That is the slope.
how are 2 samp t and paired t different?
2 samp t you are loooking at a difference between 2 averages from 2 distinct sample. With a paired test you make a SINGLE LIST OF DIFFERENCES (L3) from each pair, you then look at the AVERAGE DIFFERENCE, the average of a bunch of differences
(generally 2 measurements on just ONE sample).
How do you POOL with PROPORTIONS?
You combine the two samples into one big sample?
TOTAL # RED BEADS / OVERALL TOTAL OF ALL BEADS
What is statistical inference?
Using a statistic to infer something about a parameter.. Basically, using a sample to say something about a population.
what happens to t models as n gets larger?
The models look more like the normal model. An infinite sample size would turn a t model into the normal model.
For the following output for the association between test score and amount of time studied, Create and interpret a 95% confidence interval for slope.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
STAT +- CRIT (SE)
5.82 +- CRIT (2.66)
crit is just INVT(.025, 73)
df is n-2 for regression
What is alpha?
It is the rejection threshold. You reject p-values below it.. It is how willing you are to make a Type 1 error.
alpa=P(Type I error)
For the following output for the association between test score and amount of time studied, interpret the S in context.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
S is the standard deviation of the residual, or the typical residual. It is how far off we expect our actual data value to be from the model (from the predicted value). In context: We can expect our actual test score to be about 7.7 points off from the test score predicted by our model, based on the amount of time we studied.
What are the mean and standard deviation of a sampling distribution for a mean?
mean is mu and standard deviation is sigma/root n
(look at formula sheet) N(mu, sigma/rootn)
How do you calculate SAMPLE SIZE with proportions?
n = Z2 pq / ME2
what does 95% confidence mean when we make an interval?
It means if we took a ton of samples, and made confidence intervals from each of them, ABOUT 95% of the intervals would contain the parameter, 5% would not.
What is a confidence interval?
it is a parameter catcher.. Like a fishing net?. We stand at our statistic, and reach up and down a margin of error, WE ARE NOT IN THE MIDDLE OF THE PILE!!! and hope to CATCH the parameter? sometimes we do, sometimes we don’t? but we never know.. Mooo hooo hooo haaaa haaa haaa (evil laugh)
What are conditions for chi squared?
indep, rand, <10%, 5 or more in EXPECTED cells
THINK OF Type 2 error?
“MISSED OPPORTUNITY” “YOU ARE SICK, BUT WE MISSED IT” “THE PROGRAM WORKED, BUT WE DIDN’T NOTICE”
What is effect size?
difference between null and true parameter.
something we don’t know
What are the conditions that have to be met in order to use a normal model for the distribution of sample proportions? (sampling distribution of proportions).. (the distribution of p-hats)..
- Randomization (this helps with assumption of independence
- SMALL ENOUGH SAMPLE … 10% condition (this is the upper limit of our sample size. above this, the sampling distribution starts looking leptokurtic (thinner and taller), not normal)
- LARGE ENOUGH SAMPLE.. success/failure: np and nq > 10. this is the lower limit of our sample size. It is when the sampling distribution starts looking normal.
Where did the s.d. of differences of proportions that is on the formula sheet come from?
From the square root of the added variances of the the sampling distributions of the 2 proportions?
How do you calcuate SAMPLE SIZE with means?
n= (t*s/ME)2
Often, z is fine.. but
First calculation, use Z crit.. Then go through and calculate n.. Use that n for a t crit and do it again.
what is difference between assumptions and conditions?
Assumptions must be made in order to perform inference. We need to assume independent sample values and a large enough sample (but not too large).
We check conditions to help support our assumptions.
When do you know it is GOF test?
When you have ONE ROW or ONE COLUMN.
then it gives you a ratio , like 1:2:5
or it gives you expected percents.
Describe the distribution of a sample
It will look like the population. The distribution of a sample is a histogram made from the sample, which will look kind of like the population. If the population is bimodal, then the distribution of the sample is bimodal. The SAMPLING distribution of a bunch of means, however, will look normalish.
With regression computer output, how do you find the p-value for hypothesis test with null: slope=0
p value is given at the end of the row that the slope is in!
It is the SLOPE/SE (because the t is [slope - 0)/ SE]
THINK OF Power?
ability to detect change, or to detect what test was designed to detect.
What is the Fundemental Theorem of Statistics?
The CLT!!
The Central Limit Theorem!
pile of stats surrounds the parameter and is normalish with large enough sample size!!
For the following output for the association between test score and amount of time studied, interpret the r-squared in context.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
77.5% of the variability in test score can be explained by the model with hours studied.
How else can you explain power?
The likelihood you correctly reject a false null
When do you use 1 prop z test instead of one prop t test?
There is no 1 prop t test. You use Z for props. T for means.
What is a margin of error?
critical * s.d..
It is how far you reach out in a confidence interval..
You reach up AND YOU REACH DOWN one of these,
so the interval is actually 2 margins of error wide.
Your confidence interval is (.25, .35). What is your point estimate?
0.3
(UB+LB) / 2,
avg of the numbers,
in the middle
POWER + BETA =
1
How can you increase power?
Increase alpha
increase sample size..
Can you decrease alpha while increasing power (even though they move together?)..
Yes.. They move together with constant sample size. If you increase the sample size, you can decrease alpha and increase the power.
Are models what really happens?
No. A model train is not a real train. We use models to say what kind of happens.
If you are testing to see if more students use tobacco now, and you find that was enough evidence to say that more do, but actually, there was not an increase, what type of error did you make?
Type 1 error
How are power and alpha related?
they go up and down together
If the null is true, what is the only error you could make?
Type 1
How do you write conclusion if you fail to reject?
With a p-value this high. I fail to reject the null. (I retain the null). There is not enough evidence to say that more students like eggs now.
Do alpha and beta work with means?
Yep, alpha, beta, power, type 1, type 2 all go along with means and proportions AND regression AND CHI SQUARED!!
What are the mean and standard deviation of a sampling distribution for a proportion?
mean is p and sdandard deviation is root pq/n
(look at formula sheet)
N(p, root (pq/n) )
When we are looking at differences of proportions, what is the sampling distribution a distribution of?
You have to imagine taking a a pair of samples, say.. Of girls and boys, subtracting phat girl-phat boy, and then writing that difference down. Do this over and over again, and you will have a list of differences. Now make a histogram of that list of differences, and that is your sampling distribution. It is an imagined distribution of an infinite amount of differences (of sample pairs)..
What is a 2 sample t interval?
Well.. Suppose you were trying to find the difference between the IQ of math teachers and IQ of English teachers. You sample 50 of each and find math xbar= 125 and English xbar=115. So, the difference of the samples is 10 points. That is your statistic. 10 points. You now have to add on a margin of error, let’s say.. 4. so, youll say something like “I’m 90 % confident that math teachers score between 6 and 14 points higher on IQ tests.”
For the following output for the association between test score and amount of time studied, what is the equation of the LSRL?
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
Score HAT = 45.3 + 5.82 (hours studied)
What did Bill Gossett Do?
He sat on the Normal model and drank some tea. (t model looks like someone sat on the normal model)
What does CLT say about the distribution of the population?
Not much
just that it doesn’t matter what it is..
With large samples..
The SAMPLING dist will be approx normal
(dist of stats.. NOT DATA)
Do you pool with means (t test)?
No you don’t have to do it. Only pool with hyp tests for props. Pooling with means is a nasty process, if you think the populations have similar variances, then have your calculator pool.
what are the conditions for chi squared?
counts, five or more in each expected, independent (random), <10%
THINK OF Type 1 error?
“BUT I THOUGHT THINGS CHANGED” or “BUT I THOUGHT IT WORK” or “BUT I THOUGHT YOU WERE SICK”
Do you use p-hat or p-null when you calculate your standard deviation?
use p-null..
when do you need crits?
in confidence intervals
(and old fashioned hyp tests)
What are the conditions that have to be met in order to use a t-model for the distribution of sample means? (sampling distribution of means).. (the distribution of x-bars)..
- Random and independence
- Not too large, less than 10% of population,
so 10n<n></n>
<p>3. NOT TO SMALL n>30</p>
<p>(if n<30 must be normalish)</p>
</n>
how do you find expected count if n=25 for a 1:3:1 ratio? What test is it?
GOODNESS OF FIT
find total: 1+3+1 = 5 divide all by five and that gives expected percents
1/5 : 3/5 : 1/5 .20 : .60 :.20
now multiply each by n and get expected counts.
Almost always not a whole number. 25(.20) : 25(.60) :25(.20)
How to find expected cell count on a matrix?
ROW TOTAL* COLUMN TOTAL/ OVERALL TOTAL
How do you make a confidence interval with computer output?
STAT +/- CRIT SE
The STAT and SE are given side by side
the t crit is stilll INVT(area 1 tail, n-2),
Just put the +/- t crit between the actual slope and the given std. error. The calculation is simple
How do statistics from big samples compare to small?
Larger sample statistics have less variablility, so statistics from them are closer to the parameter and eachother (sampling distribution has smaller standard error). Statistics from smaller samples are more likely to be far away from true parameter.
This is why increasing sample size increases power, it narrows the piles of statistics.
how do you find deg freedom?
n-1 for one sample
2 samples you must use calculator..
REGRESSION IS n-2
GOF is cells - 1
indep and homog is (r-1)(c-1)
who invented the t model?
Bill Gosset, guiness brewing company.
What is the quick sample size calculation for proportions?
1/ME2
For the following output for the association between test score and amount of time studied, Test a hypothesis to see if there is a significant association between time studied and test score.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
Ho: B1 = 0
Ha: B1 not= 0
(imagine a pile of sample slopes centered at 0 with SE of
BE SURE THEY ARE BETAS (Greek B’s) small b is a sample slope, Beta is population slope
P value is given: 0.0159, so interpret it !
What is diff between homogeneity and test for independence?
homogeneity is more than one sample and asking about one variable, independence is just one sample with two variables.
Your confidence interval is (.25, .35). What is your margin of error?
0.05
What is “statistically significant?”
When p-value is below the alpha, we say “statistically significant”.. Low p-values are statistically significant. When our sample most likely didn’t happen randomly, that is statistically significant.
how are alpha and beta related?
as one increases, the other decreases, and vice versa
Simple quick way to describe difference between paired T and 2 sample T.
Paired T looks at average of differences, 2 sampe T looks at the difference of averages.
what is df for goodness of fit?
cells - 1
Can you draw the alpha/beta/power diagram?
See page 486. Be able to draw and label this the way we do in class with the box and “RETAIN REJECT” up top and “Ho TRUE, Ho FALSE” on left.
How do you write conclusion if you reject?
With such a low p-value, I reject the null hypothesis. There is strong evidence that the proportion of students who eat rice has changed.
what is a statistic
some numerical summary of a sample.. Could be the mean of a sample, the standard deviation of a sample, the proportion of successes in a sample, the slope calculated from a sample, a difference of 2 means from 2 samples, a difference of 2 proportions from 2 samples, a difference of 2 slopes from 2 samples.. you can make sampling distributions for any of these, and they will all be centered around the parameter…
What does the CLT say about the distribution of actual sample data?
Nothing? The sample will be distributed similar to the population. The CLT only talks about distributions (histograms) of sample statistics, which are groups of means.., NOT OF INDIVIDUALS!!!! NOT DATA
What is power?
The probability that you correctly rejected a false null
What does Central Limit Theorem Say?
It basically says.. NO MATTER WHAT SHAPE THE POPULATION IS (normal, bimodal, uniform, skewed, crazy.. )? If you make a histogram of a bunch of means taken from a bunch of samples, that histogram will be unimodal and symmetric WITH LARGE ENOUGH SAMPLES.. Close to normal. So.. A nerdy way to say it is: The sampling distribution of means is approximately normal no matter what the population is shaped like. The larger the sample size, the closer to normal. (the normal curve is just a model.. the sampling distribution is close to it, but not it! we use the model anyway!)
What is a point estimate?
Your statistic. You stand at the point estimate and reach up and down to make an interval
For the following output for the association between test score and amount of time studied, Where does the “p” column come from?
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
r-square: 77.5 S=7.7
The p is the p value from a hypothesis test assuming the y intercept and the slope are zero. If you put those T-STATS into tcdf, you get the p values from this column.
What are the three chi-squared models?
goodness of fit, test for homogeneity, test for independence
How do you find degrees of freedom for 2 sample mean stuff?
MUST USE CALCULATOR
WHAT EQUATION HAS INTERVAL WIDTH, Z CRIT and SE IN IT?
WIDTH = 2 (Z) (SE)
how are beta and power related
as one increases, the other decreases, and vice versa? They have to because they BOTH ADD TO ONE!!! Power + Beta = 1
Where did the s.d. of differences ofmeans that is on the formula sheet come from?
Combining models. From the square root of the added variances of the the sampling distributions of the 2 means
What is the common misconception about confidence intervals?
They know you stand at your stat and reach up and down? BUT?. People think that their statistic is in the middle of the pile of p hats or x bars? in reality, they are almost definitely NOT? they are out on one of the sides, and they are reaching up and down and trying to catch the center!!! YOU ARE NOT IN THE MIDDLE OF THE PILE!!
For the following output for the association between test score and amount of time studied, interpret the y intercept in context.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
r-square: 77.5 S=7.7
The model predicts that a person who doesn’t study at all will score about a 45.3 on the test.
How do you POOL with MEANS?
YOU DON’T
Can you prove a null hypothesis true?
NO
How is a confidence interval made?
statistic +- margin of error ??.. Statistic +- (crit * s.d )??. Stand at the statistic, reach out a margin of error, and hope that you catch the parameter.
what is a parameter?
some numerical summary of a population. Often called “the parameter of interest.” It is what we are often trying to find.. It doesn’t vary. It is out there and STUCK at some value, it is the truth, and you’ll probably not ever know it! We try to catch them in our confidence intervals, but sometimes we don’t (and we don’t know it!). It Could be the mean of a population, the standard deviation of a population, the proportion of successes in a population, the slope calculated from a population, a difference of 2 means from 2 population, a difference of 2 proportions from population
What are the conditions that have to be met in order to use a normal model for the distribution of sample proportions? (sampling distribution of proportions).. (the distribution of p-hats)..
- Random and independence 2. Not too large, less than 10% of population, so 10n 10. . Too large samples have sampling distributions that are leptokurtic (narrower and taller than normal model), too small samples have skewed or other shaped sampling disributions.
how do you find z and t crit?
for z crit.. INVNORM(area in 1 tail)? for t crit?. INVT(area in 1 tail, deg freedom)
One tail or 2 tailed? How do you tell?
if it just says “changed” or “different”.. Then it is 2 sided.. DOUBLE THE P VALUE!If it says “more” “less than” “greater” etc.. Then it is just one sided..
For the following output for the association between test score and amount of time studied, where does the “t stat” column info come from?
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
r-square: 77.5 S=7.7
T stat is the T score, the test statistic for null=0:
(STAT-NULL)/SE (45.3 - 0) / 4.3 = 10.53
(5.82-0) / 2.66 = 2.189
how can you decide the right test? What are the 3 questions?
1 or 2 samples? Proportions (z) or Means (t)? Test or Interval?
For the following output for the association between test score and amount of time studied, what is the correlation coefficient?
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
r-square: 77.5 S=7.7
if r squared is .775, then you have to take sqrt of that.. So r= 0.8803
how are t models like Normal models?
both are unimodal and symmetric. T models aren’t as high and have more area in tails, that?s why you have to reach out a little further than z for same confidence. Bill sat on the normal model.
what are the conditions for inference for slope? (hyp test or confidence interval for slope?)
straight enough (check residuals for random scatter), random and independent, and look at the HISTOGRAM OF THE RESIDUALS and make sure they are unimodal and symmetric
what is df for chi squared homogeneity or independence?
(rows-1)(columns - 1) (remove a row and a colunmn an count the cells that are left)
For the following output for the association between test score and amount of time studied, Interpret the slope in context.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
On average, for each hour a student studied more than another student, their test scores were about 5.82 points higher.
what is a test statistic?
a t or z score (or chi squared) that you use to find a p value
What is sampling variability?
The natural variation of sample statistics.. NOT DATA.. Samples vary? so do their statistics.. Parameters do not vary!
Where did Bill Gossett work?
GUINNESS
Is a confidence interval a PROBABLILTY?
NO
xbar and mu in t-test?
xbar is your sample mean, mu is yor hypothesized population mean
How do you find Expected Count?
for GOF: Exp %(total).. For indep and homog: ROW*COL/TOTAL
With regression computer output, how is the t-ratio and the p-value calculated?
T ratio is just SLOPE/ST ERROR and the p value is just TCDF(T ratio, 9999, n-2)
What is advantage of pooling?
Pooling allows you to increase your sample size? sort of,
If you are testing to see if more students use tobacco now, and you find that there was not enough evidence to say that more do, even though more actually do now, what type of error did you make?
Type 2 error
Your confidence interval is (.25, .35). What is your standard error?
it depends on the critical value.
what are the conditions that have to be met for t procedures with small samples?
independent groups, random, <10% of pop and nearly normal.
What is a critical value?
It is the amount of standard deviations (errors) you’ll reach out, depending on your confidence (a t or z). Example.. 68% crit z = 1 ?.. For 95% crit z = 2 (well, 1.96).. For means.. Use t crits
What is a standard error?
typical distance a statistic is from the parameter. The avereage distance to the middle in a sampling distribution. Called standard error because it is the typical error you would espect in a sample.
If the null is false, what is the only error you could make?
Type 2
What are the 3 steps in hypothesis testing AFTER YOU CHECK CONDITIONS?
- Make your Ho and Ha2. Make a Null Model (centered at null, use your Ho as center and in calculations, use your sample size).. This is a sampling distribution for the statistics if the null were true.3. CHECK? Calculate your statistic (p-hat, x-bar, phat1-phat2, xbar1-xbar2)
Do you use p-hat or p-null when you check the success/failure condition?
use p null
Your confidence interval is (.25, .35). What is your statistic?
0.3 (UB+LB) / 2, avg of the numbers, in the middle
If you fail to reject, what is the only type of error you could make?
Type 2
What is sampling error?
same as sampling variability.. The natural variability between STATISTICS.. NOT DATA!!! . We call it error EVEN THOUGH YOU MADE NO MISTAKES!!!
What is beta?
It is probability that you’ll make a Type II error.. P(Type II error)
If you reject, what is the only type of error you could make?
Type 1
when is data “paired”
when you have 2 measurements on the same subject (or matched subjects). Often Before-After
What is a 2 sample t interval?
Well.. Suppose you were trying to find the difference between the IQ of math teachers and IQ of English teachers. You sample 50 of each and find math xbar= 125 and English xbar=115. So, the difference of the samples is 10 points. That is your statistic. 10 points. You now have to add on a margin of error, let’s say.. 4. so, youll say something like “I’m 90 % confident that math teachers score between 6 and 14 points higher on IQ tests.”
What are we confident in?
our confidence lies in our interval? if we took another sample.. We’d have a different interval..
when do you have to check conditions?
In all inference procedures: ANY CONFIDENCE INTERVAL OR HYPOTHESIS TEST (including chi squared and slope stuff)
What are conficence intervals for?
They are an attempt to say what the true population parameter is.. It is our best guess? “We think that there will be between 6 and 12 inches of snow?”
N ( ?1 , ?2 ) what does this mean?
it means NORMAL models centered at ?1 With a standard deviation of ?2
what is error?
distance from statistic to parameter, how far you sample statistic is off from the truth.
What is a p-value
It is the probability of getting your sample randomly if the null were true. Basically, how likely is it that your sample statistic came from the Null Model.
Why does the book use ybar instead of xbar?
I don’t know
Will 95% of other statistics be within my interval?
NO!!! You have no idea where your interval is in regards to true parameter
Can you make a 100% confidence interval?
Sure, I’m 100% confident that it will snow between 0 and 500 feet tomorrow?
Can you accept a null hypothesis?
Never accept a Ho
If you were going to pool with means (t), which you probably won’t have to do, when would you?
When you have reason to believe the variances of both populations are equal.
What is the difference between the distribution of a sample and a sampling distribution?
A distribution of a sample is just a histogram of the DATA in a sample. A sampling distribution is made from an bunch of sample STATISTICS. It is the distribution of the statistic that was calculated from those many many samples.
What is difference between population of interest and parameter of interest?
Population is the subjects you are interested in? Parameter is the actual number you want (like % of or AVG)
How wide is a confidence interval?
It is 2 margins of error wide
What is a t-crit?
It is the same as z crit. It is the number of sd you reach out in your CI. To find it, do INVT(area in one tail, degrees of freedom)
For the following output for the association between test score and amount of time studied, make a 90% confidence interval for the predicted score for someone who studied for 8 hours.
OUTPUT: n=75 dep var: test score
VAR coeff se coeff t stat p
intercept 45.3 4.3 10.53 0.000
study time 5.82 2.66 2.189 0.0159
r-square: 77.5 S=7.7
To make an interval for prediction, plug 8 into equation..
45.3 + 5.82 ( 8 ) = 91.86
stand there and go up and down CRIT S
Use S because it is the st dev of resid.
STAT +- CRIT (SE)
91.86 +_ Tcrit (7.7)
Tcrit is invt(.05, 73)
how do you check nearly normal for small samples?
Histogram on calculator, or normal prob plot on calculator, .OR Boxplot should be symmetrical?
What is the normal enough condition?
for smaller sample sizes, it must be plausible that the sample may have kind of come from a normalish population.
Do parameters vary?
NO!!! Statistics do? they vary from sample to sample? PARAMETERS DO NOT VARY!
What if you want more confidence?
get a bigger net.. (wider conficence interval) or increase sample size
What is a null model?
It is a sampling distribution. It tells us how sample statistics would vary if the null were true. It is centered at the null.
Why does increasing n increase power?
When you increase n, the pile of statistics gets more narrow, the statistics get closer to the true mean. Imagine your alpha beta power diagram with narrower piles, but the same distance from eachother, the power would increase!