lecture 7 Flashcards

1
Q

what must we be careful with with sample variance

A

rounding
if round mean = can get a negative variance which is impossible
so be careful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what do we need to describe a data set

A

provide
numerical measures of center = mean, median, mode
Numerical measures of spread = range, variance
we must describe distribution of values or obs in sample
can be symmetric around a central point but not always

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

describe symmetric distributions

A

Sample have perfectly symmetric distribution if its histogram has a symmetric shape around some x value
mean = median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how often does sample have perfectly symmetrical distribution

A

RARE
in some cases approximation pretty good
so mean ~ median
around the same
Unlikely they are numerically exactly the same - more possible for a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how can data be skewed

A

left or right skew
depends on where tail is = away from central, where freq of obs low

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

describe right skew

A

positively right skewed data = histogram shows long tail on right
median < mean
mean moves to right
right tail dominating
not outliers tho, since like slowly drags out tail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

describe left skew

A

negatively left skewed data = histogram shows long tail on left
median>mean
mean pulled to left bc directly influenced by data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

can we compete sample mean, median and variance if we only see histogram

A

nooo
data grouped in intervals
by looking at height we can see how many obs in interval but do not know exact location within each - hard to make precise conclusions only rough calculations
only approx
also cannot draw inferences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

from samples to populations

A

data representative of some underlying population
real goal is to understand characteristics of population based on sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

corresponding population quantities - things we measure

A

sample mean = Xbar –> pop mean = μ
sample variance = s^2 –> pop variance = σ^2
Sample standard dev = s –> pop stand dev (s.d.) = σ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does sample have to be

A

representative of population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how to think about population - interpretation

A

as sample with extremely large sample size
keep collecting data until learnt everything about way data generated
Very large, infinite sample size= we have learnt everything we possibly could learn
Ultimately - histogram where bins gets smaller and smaller until it becomes a nice curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly