week 4 -- SD as a ruler & z-scores Flashcards

1
Q

Mean (equation)

A

mean y = sum of y / n

ȳ = sigma y / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

descriptive vs. inferential

A

descriptive comes from our sample
inferential are statements about the population
–>
to make inferences about a population, I have to construct a model of that population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Statistic

A

item of numerical info about the SAMPLE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Paramater

A

item of numerical info about the MODEL (i.e., the POPULATION)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Estimator

A

a statistic used to estimate a parameter (e.g., sample mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Error

A

NON-SYSTEMATIC difference btw estimator and parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Bias

A

SYSTEMATIC difference btw estimator and parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Standard deviation – a measure to quantify spread of a sample (or population)

Allows us to answer: “How remarkable is a single observed value”?

algebraically = square root of variance
(square root of Σ (y- ȳ)2 / n
or with Bessel’s correction: Σ (y- ȳ)2 / n - 1

A

shows how close a data point is to the mean of the sample – BUT observations in a sample are always closer to their own mean than to the population mean. SO uncorrected SD is a biased estimator (OK as purely descriptive statistic)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the trick for comparing performance btw very different-looking values (e.g., meters run vs. time ran)?

A

Standard deviation!
(use as a “ruler” to measure distance from the mean)

expressing distance with SD “standardizes” the performances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

z-score 1

allows us to compare apples and oranges (eliminates units)

A

letter z denotes values that have been standardized!!
(with mean & SD)

z = y - ȳ / s

z-score = performance - mean performance / standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

z-score 2

Comparsion shows us which score is more extraordinary

A

z-scores have NO UNITS
they tell us how far the data is from the mean
2 = 2 SD above the mean
-1.5 = 1.5 SD below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

shifting data

plus or minus

A

Only measures of position change (center, min, max)

Neither shape nor spread changes (range, IQR, SD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

rescaling data

multiply or divide

A

all measures of position (mean, median) and spread change

shape remains constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

standardizing into z-scores shifts data by the mean and rescales them by the standard deviation

A
Shape stays constant
center changes (mean = 0)
spread changes (SD = 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A statistical model is always wrong. Explain.

A

it is “wrong” in the sense that it doesn’t match reality exactly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Normal model

A

a way to show how extreme a z-score is

17
Q

N (μ, σ)

(Normal model

A

mew μ, sigma σ
μ = mean of a Normal model
σ = standard deviation of a Normal model

18
Q

Greek letters

A

NOT numerical summaries of data – they are part of the model, parameters

19
Q

Latin letters

A

summaries of data, statistics

20
Q

standardized data for model

A

z = y - μ / σ (for parameters)

cf.
z = y - ȳ / s (for statistics)

21
Q

68 - 95 - 99,7 rule

A

In a Normal model:
68% of values fall within 1 SD of the mean
95% of values fall within 2 SD of the mean
99,7% of values fall within 3 SD of the mean