lecture 8 Flashcards

1
Q

describe empirical rule - gen

A

distributional properties in pop or sample
based on measures of central tendency and variability
tells us how much of data we will observe in 1,2 or 3 standard deviations away from mean
*for perfectly symmetric mound shaped distributions, concentrated around mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

describe empirical rule - specifics

A

roughly 68% of the obs will lie in the range = mean +- s.d.
roughly 95% of the obs will lie in the range = mean +- (2 x s.d.)
roughly 99.7% of the obs will lie in the range = mean +- (3 x s.d.)
“how much data in region”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

describe empirical rule - graphs

A

only theoretical
area under curve - just add up heights then divide by sample size = give relative freq
Riemann sum = histogram approximation
sample based calculation if replace pop quantities with sample quantities = get same result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

describe empirical rule - body temp ex

A

gives bounds = 1,2 or 3 s.d. away
Empirical = theoretical
actual = pretty close but not exact
as long as graph is symmetric (no skew) = result will hold

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

describe empirical rule - heights ex where = mean (xbar) = 72 inches and s = 3.5 inches
where 68% of men in sample will have height in interval and same for 95%

A

(72-3.5, 72+3.5) = (68.5, 75.5 inches) ~ 68%
(72- 2 x 3.5, 72+ 2 x 3.5) = (65, 79 inches) ~95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

describe empirical rule - statistics grades ex - want to find out interval that captures ~95% of observed data ??

A

(xbar-2s, xbar+2s) is the interval
approximating sample content in particular regions of observation range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

describe empirical rule - mathematics grades ex

A

if assume symmetric and calculate for ~95%, interval = 56.3, 110.7)
above 100
limitation = symmetric construction can lead to interval that doesnt overlap with measurement range
symmetric but mound in upper range
empirical rule NOT good to use here
can adjust interval to (56.3, 100)
not possible in context of experiment, outliers at lower end inflate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

describe empirical rule - exam grades ex

A

graph shows long left handed tail =negative skew
also upper range above measurement range
Empirical rule does not work well
still roughly right proportion but range not sensible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

does the empirical rule always work

A

NUH UHNNNN
for skewed data = no
happens often with range restricted observations
Empirical rule usually pretty robust but can be broken
interval will not contain amount of data it says it will

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is chebyshev’s rule - gen

A

gives aprox bound
at least certain % can fall within interval
empirical rule = symmetric
but this is for any distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

describechebyshev’s rule - formula

A

for any distribution any number k>1: at least (1-(1/k^2)) x 100% of observations will fall into interval (xbar-ks, xbar+ks)
REGARDLESS of shape of histogram
shows relationship between mean, variance and distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

describe chebyshev’s rule - formula for specific k’s

A

k=2, % of obs falling within 2 s.d. of mean is AT LEAST = (1-1/2^2) x100% = 75%
k=3, % of obs falling within 2 s.d. of mean is AT LEAST = (1-1/3^2) x100% = 75%
Result applies to samples and populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

describe chebyshev’s rule - math grades ex

A

couldnt rely on empirical rule
since distribution skewered
gave an interval that didn’t work
still gives same interval for chebyshev’s rule bur now = AT LEAST 75% instead of 95% from empirical rule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe chebyshev’s rule - body temp ex

A

can apply chebyshevs and say that at least 75% of temps are in interval (xbar -2s, xbar+2s)
but in this case empirical rule does apply so
Roughly 95% of obs is a better approx than at least 75% in this case
empirical rule is more precise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

describe chebyshev’s rule - conclusions

A

if data is mound shaped and approx symmetrical = better to use empirical rule = close approximation to the actual percentage
otherwise = use chebyshevs
can approximate bin content in any part of range just using sample mean and variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is a z score

A

wish to report how extreme a value is in dataset
= how far obs is from center on some standard scale
can use z score to answer question
how sample mean and variance used to summarize data
measurement-mean/s.d = measures extremeness

17
Q

describe z score formula - sample

A

measure how extreme data point is compared to sample mean (measure of central tendency) but standardizing by dividing by s.d.
sample mean = xbar and sample s.d = s so z score
z score = x-xbar/s

18
Q

describe z score formula - pop

A

z score = x-μ/σ

19
Q

describe z score interpretation

A

how many standard devs away from mean value x is
measure of extremeness for x on scale of standard dev