Midterm 2 Flashcards

Question

inference testing

Answer 1

inference - drawing conclusions about a pop. (parameter) based on a sample (Statistic) with a measure of uncertainty - -everything we've learned so far is to ensure valid inference - -making generalizations about the population based on sample data Pop. 1. producing data from pop. 2. exploratory data analysis (data from sample) 3. probability 4. inference about pop.

Answer 2

1. POINT ESTIMATION --quantitative data ex. based on sample of n = 47 policies, we estimate that the ave. premium is approx. $1800 --categorical data ex. based on a. sample n = 144 households, we estimate that the proportion of infected bamboo is approx. 10.4%

Answer 3

2. INTERVAL ESTIMATION --quant. data ex. based on sample of n = 47 policies, we estimate that the ave. premium is btw $1700 and $1900 --categorical ex. based on sample of n = 144 households, we estimate that the proportion of infected bamboo is btw 8.4% and 12.4%

Answer 4

3. HYPOTHESIS TESTING --Quant. ex. insurance agent believes that the ave. premium at her agency is $2500. Based on the sample n = 47 claims, we found that the ave. premium was $1800. this data, therefore, provides evidence that the ave. premium is less than $2500. That is, x bar = $1800 is an outcome that would rarely happen if the ave. was indeed $2500. --categorical ex. researchers believe that the proportion of infected bamboo is 2%. Based on a sample of n = 144 households, found that prop. was 10.4% This data provides evidence that the prop. is > 2%

Answer 5

ESTIMATOR - -general statistic that estimates the parameter - -ex. the estimator of the pop. mean MU is the sample mean of x bar ESTIMATE - -a specific value of an estimator - -ex. the ave. value of n = 47 is $1800 - the prop. of infected boards for n = 144 is 10.4% - -the estimator of the pop. proportion p is the sample proportion p (with hat)

Answer 6

A.) sampling needs to be done RANDOMLY B.) sampling distribution of x bar tells us: 1. on ave. x bar will give us the right answer - Calle this property UNBIASEDNESS 2. As sample size n INC. the accuracy of x bar INC. (smaller st. dev)

Answer 7

1. CONFIDENCE INTERVAL point estimate +/- ME 2. Test of significant res. fail to rej. at level alpha both solve for: - -conclusion about parameter - -measure of uncertainty LOOK AT CHART - CH. 17

Answer 8

STATE --Specify parameter of interest PLAN --choose procedure, level of confidence SOLVE --check conditions, carry out procedure CONCLUDE --interpret confidence interval LOOK AT MY WRITING ASSIGNMENT

Answer 9

STATE --Specify claims about parameters of interest PLAN --choose procedure, specify Ho, Ha, alpha SOLVE --check conditions, calc. test statistic and p-value CONCLUDE --compare p-value to alpha, interpret test results

Answer 10

Hypothesis testing

Answer 11

- -conclusion about MU - -gather data using SRS - -Sigma known (rarely known for pop.) - -sampling distribution of x bar is normal bc. 1. pop. is normal or 2. large sample, due to CLT construct 95% confidence interval --Plan to gather a random sample of 100 students. x bar estimates mea systolic blood pressure of all students parameter: mean systolic blood pressure of all students - -sample size: n = 100 - -st. dev. - sigma = 14.0 mm HG how close should x bar be to Mu? informal distribution. then by 68,95,99.7 rule we cam say: --the prob. that x bar is w/in 2 st. dev. (2*14/ radical 100 = 2.8) of Mu is .95 OR we are 95% confident that Mu is within 2*(14/radical 100) of x bar or that interval x bar +/- 2* sigma/radical n contains MU CAN'T SAY "PROBABILITY" WHEN TALKING ABOUT MU, So use "confident" ``` ex. n = 100, x bar = 123.4, sigma = 14 confidence interval can b written as (120.6, 126.2) OR (120.6 - 126.2) OR (120.6 to 126.2) ```

Answer 12

X bar +/- 1.96* sigma/radical n the 1.96 uses table of st. normal prob.and corresponds to the MIDDLE 95% of normal distribution use confidence interval to fin z. and c% confidence int. for MU --genreal formula for c% cond. int. X bar +/- z* (sigma/ radical n) z+ = confidence level you want z is found at bottom of C TABLE

Answer 13

- -We replace sigma with s - -we replace z* with t* bar +/- t* s/ radical n t* = level of confidence you want IF data gathered using SRS - -Sigma unknown - -normality of pop. distribution or large sample size then: sampling distribution of (x bar - mu) / (s / radical n) has a student's t-distrib. --WITH n-1 degrees of freedom DF

Answer 14

- -symmetric, bell-shaped, mean = 0 - -the smaller the DF the larger the spread (bc more uncertainty due to s) - -the larger the DF, the closer the t-distrib. to the standard normal

Answer 15

1. each t distribution is determined by its DF: df = n - 1 2. if actual DF is not on table, use DF closest to actual without going over 3. t* values are found in bod of table C

Answer 16

1. STATE problem 2. PLAN - -Select procedure: one sample t CI for means - -select confidence level - -select parameter of interest in context 3. SOLVE --collect and plot data --calc. c bar and s --check conditions (randomness and normality of pop distribution/ large sample size w/ no outliers --calc. confidence interval using formula: bar +/- t* (S / radical n) ``` 4. CONCLUDE interpret CI in context including --confidence level --parameter of interest --calculated interval ```

Answer 17

1. STATE: students at large church activity took survey of 121 married students to estimate 2/ 90% confidence how long these students had dates on ave. before getting engaged 2. PLAN: use "one sample t CI for MU - -confidence level:90% - -parameter of interest: mean # of months married students dated before engagement 3. SOLVE: collected SRS of 121 married students s = 10, x bar = 9.34 --conditions: random (SRS) --normal or large sample size, n = 121 (1st CHECK IF CONDITIONS MET..IF ARE CAN CONT. AND FIND CI) 9.24 +/- (1.66) * (10/radical 121) = = (7.83, 10.85) 4. CONCLUDE: - -we are 90% confident (NOT PROBABILITY) that the interval (7.8 mo, 10.8 mo) contains the TRUE MEAN # of mo. married students at tis activity dated before getting engaged

Answer 18

in repeated sampling, the confidence level is the percentage of confidence interval produces by procedure that actually contain the value MU, or the success rate of the procedure ex. 95 of all possible 95% conf. interval estimates for MU actually contain the value of MU - -the confidence is based on the PROCEDURE, not the INTERVAL

Answer 19

false Suppose a sample of size 250 was taken instead of size 100. How will the margin of error change? --margin of error would DECREASE

Answer 20

t* sigma/ radical n MARGIN OF ERROR DOES NOT INCLUDE THE SAMPLE MEAN just the second part of equation after =/-

Answer 21

standard error of x-bar

Answer 22

x bar +/- z* sigma/radical n - -x bar = point estimator - -z* = confidence multiplier - -radical n = st. dev. of pt. estimator - -everything RIGHT of +/- is MARGIN OF ERROR (m) as sample size INC, margin of error DEC. -AS N INC, width of confidence. interval DEC. --confidence level INC, margin of error INC also can be written as x bar +/- m.

Answer 23

1. margin of error (m) controls the width of the interval 2. as sample size INC, m and width DEC 3. as conf. INC, m and width INC m = (z*sigma)/ radical n ex. sigma = .25, m must be no larger than .05 with 99% confidence - -how many times should PH be measured? n = ((z* sigma)/m)^2 = 165.89 = 166 --ALWAYS ROUND UP TO NEXT INTEGER (even if 165.1 --- 166).

Answer 24

(.21 + 2.45)/ 2 = 1.33 1. 33 - .21 = 1.12 2. 45 - 1.33 = 1.12 M = 1.12!!! --Margin of error is the distance from lower bound to mean and upper bound to mean

Answer 25

= s / radical n --not just S!!!!! S/ radical n = estimates the st. dev. of the sampling distribution of x bar

Answer 26

the value of a population parameter

Answer 27

t disturb. has the same center but is more spread out than st. normal distribution

Answer 28

plausible values that a parameter could take | --NOT a measure of the confidence we can have in our sample results representing the population

Answer 29

1. draw conclusion about PARAMETER using STATISTIC 2. WITH a measure of uncertainty 1. confident interval = estimates a parameter 2. test of significant = hypothesis testing = to assess a claim about a parameter'' "significantly reduced" = mean score was TRULY reduced - -can prove it is NOT true but can't prove it is - -we failed to disprove a claim, can' prove it is true but can prove life its not ALWAYS assume claim that researchers think is NOT true - -approach called PROOF by CONTRADICTION (can't disprove so can argue it is true) - -researchers think claim is FALSE so they assume it was TRUE

Answer 30

1. CLAIM 1: (Ho) - Null hypothesis - always be explained in problem as ___ = ____ CLAIM 2: (Ha) - alternate hypotheses - always be ___ , not = ___ 2. OUTCOME - standardized outcome that measures how far the outcome diverges from CLAIM 1 (p-value) - -outcome represented by standardized stat. called a "Test stat" (p-value) 3. ASSESSMENT OF EVIDENCE - -how likely is it to get this outcome if claim 1 is true? - -p-value - prob. will get something in the tail 4. CONCLUSION - -an outcome that would rarely happen if claim is true is good evidence that Claim 1 is not true, hence we believe claim 2 is true

Answer 31

related measure of uncertainty is alpha which represents the prob. of falsely rejecting claim 1 or Ho : alpha defines what is rare or unlikely outcome a test with >/< in Ha is a ONE-SIDED TEST test with NOT = is a two-sided test

Answer 32

number that summarizes data for a test of significant - -companies an estimate of parameter from sample data with value of parameter given in null hypothesis - -measures how far sample data verge from Ha - -large va are not consistent with Ho: evidence against Ho - -used to find prob. of obtaining sample data IF Ho were true - -ex. of test statistic t = bar - Mo /(s / radical n)

Answer 33

a # btw o and 1 0 <= P-val. <= 1 -the prob. of getting test stat as extreme or more extreme than observed if Ho were true --measure of strength of agreement btw observed test stat and Ho --measures evidence against HO STRENGTH OF EVIDENCE AGAINST Ho - PVAL IS THE PROBABILITY THAT H0 IS TRUE!! LOW PROB. MEANS SHOULD REJECT Pval close to 0 = good evidence H0 is not true = evidence for Ha Pval - .5 p val. = 1 no evidence that Ho is not true = not evidence for Ha

Answer 34

artificial but imp. - sharp boundary btw rejection and non-rejection regions for p-value --if p-val. <= alpha - diff. is statistically significant, reject Ho and conclude it as false ALPHA = ASSUME .05 UNLESS SPECIFIED IN PROBLEM

Answer 35

P-val. <= alpha: REJECT Ho - -declare observed diff. statistically significant - -conclusion: believe Ha - -diff. btw claimed parameter value and calculated statistic likely real, not chance - -stat tests do not address issue of importance ("practical significance" P-0val > alpha: do not reject Ho - -do not declare statistacally sign. - -insufficient evidence to believe Ha - -don't accept Ho but fail to reject Ho - -fail to rejectHo means the diff. could be due to chance, not real

Answer 36

risk of false positive (reject null when actually true) | --risk should generally be small(

Answer 37

FALSE | --Trying to prove the alternate hypothesis

Answer 38

realistic case: - -gather data using SRS - -sigma unknown - -pop. distributions. approx normal (Single peak, no excessively long tails) 1. if sample does not have extreme outliers or skewness, can assume normality or pop. 2. as n INC. skewness and non-normality less worrisome replace sigma with s x bar - Mu / (s / radical n) st. dev. = radical (sum of (x - x bar)^2 / n-1) if sigma unknown, random and normal/;are sample size then: sampling distribution. of x bar - Mu / s/ radical n has a t-distrib. with n-1 DF t = (x bar - mu) / (s / radical n) 95% conf. interval = x bar +/- s / radical n

Answer 39

false positive: rejecting null hypothesis when shouldn't have false negative - failing to reject null hypotheses when you shouldn't have TYPE 1 error: reject Ho when it is true: false positive - pronounce guilty when they are innocent TYPE 2 error: fail to reject Ho when it is false - false negative - pronounce innocent when guilty type 1 is more serious - put innocent person behind bars WORSE than letting guilty person roam free (type 2)

Answer 40

alpha = level of significance - -probability of type 1 error - -probability reject Ho when it is true BETA - -probability of type 2 error - -probability fail to reject Ho when it is false ``` POWER = probability (reject Ho when it is false) 1 = beta --power = .99 = 99% prob. of rejecting null hypothesis when it is actually false (want to be high!!) ``` LEARN THE GRAPHS!!!

Answer 41

N INC. the sample distribution. gets tighter and narrower larger sample size always INC. power ``` alpha = prob. type 1 error Beta = prob. type 2 error ``` power = prob. of rejecting Ho when it is false (with beta) effect size = diff. btw Ha and Ho (EX. mu = 28, x bar = 24, effect size = 4)

Answer 42

10 is harder - more data = easier

Answer 43

Type 1 error Alpha Reject Ho TRUE Type 2 error Beta Fail to reject Ho False Ho

Answer 44

1. effect size - # change in Ho and Ha 2. variability in measurements (sigma = out pop. st. dev. - inc. var = dec. power, dec. sigma = inc. power) 3. chosen significance level (alpha) 4. sample size (N INC = power INC (bc dec. st. dev.)

Answer 45

t = x bar - Mu / (s / radical n) significance depends on: x-bar mine Mo: --size of observed effect(numerator to test stat.) --measures how far the sample mean deviates from the hypothesized Mo --the "LARGER" the observe effect, the smaller the p-val. (which tests the prob. Ho should be accepted) --size of sample n --s/radical n - measures how much random variation we expect --larger sample size = smaller p-val. --sample size may be too small to detect significance --sample size may be so large results are always significant

Answer 46

results are declared STATISTICALLY SIGNIFICANT when P-val. <= alpha Results practically important when observed effect (numerator of test stat.) is large or imp. enough to mater practical importance is not same as statistical significant --practical im. determined by common sense large samples - unimportant diff. can be significantly significant small sample - important diff. may not be statistically signifiant --always ask whether stat. significant effect is "large enough to matter"

Answer 47

declare statistically significant if P-val. <= alpha - -consider practical. imp. when observed effect large - practical value - -observed effect: numerator of west stat (x bar - Mo) 1st. check for STAT. SIGNIFICANCE 2. d = Check PRACTICAL IMP.

Answer 48

2 * p-val. of one sided test - 2 sided req. stronger evidence than one-sided

Answer 49

1. conduct test of significance at alpha = .05 --Ho: mean egg weight = 50g --Ha = mean egg weight does not = 50 g compare p-val. with alpha and draw conclusions --find t = (x bar - Mo)/ (s/radical n) then look at DF for that 2 and make an interval 2. inspector interval - -construct 95% CI for Mu - -see if interval includes or excludes value 50 - -if CI includes 50: data provides support for Ho - -if CI excludes 50: data provides evidence to reject Ho x bar +/ t* s/ radical n

Answer 50

FALSE --A p-value is a conditional probability—given the null hypothesis is true, it's the probability of getting a test statistic as extreme or more extreme than the calculated test statistic

Answer 51

How likely it is that, in a sample of 250, we will find that the mean number of hours per week full-time corporate employees work is as high as 47 if the true mean is 40?

Answer 52

Assuming the null hypothesis is true, there is a 0.0367 probability of obtaining a sample statistic as extreme or more extreme than what we calculated. Reject the null hypothesis. The true mean hemoglobin level of all children in Jordan is less than 12 g/dl.

Answer 53

Fail to reject the null hypothesis. There is insufficient evidence to conclude that the mean hemoglobin level of all children in Jordan is less than 12 g/dl. NOT - Fail to reject the null hypothesis. The mean hemoglobin level of all children in Jordan is equal to 12 g/dl - can't accept!! But know Ha not true

Answer 54

Increase sample size.

Midterm 2 Flashcards

(81 cards)