week 4 -- SD as a ruler & z-scores Flashcards

Question 1

Q

Mean (equation)

Answer

A

mean y = sum of y / n

ȳ = sigma y / n

Question 2

Q

descriptive vs. inferential

Answer

A

descriptive comes from our sample
inferential are statements about the population
–>
to make inferences about a population, I have to construct a model of that population

Question 3

Q

Statistic

Answer

A

item of numerical info about the SAMPLE

Question 4

Q

Paramater

Answer

A

item of numerical info about the MODEL (i.e., the POPULATION)

Question 5

Q

Estimator

Answer

A

a statistic used to estimate a parameter (e.g., sample mean)

Question 6

Q

Error

Answer

A

NON-SYSTEMATIC difference btw estimator and parameter

Question 7

Q

Bias

Answer

A

SYSTEMATIC difference btw estimator and parameter

Question 8

Q

Standard deviation – a measure to quantify spread of a sample (or population)

Allows us to answer: “How remarkable is a single observed value”?

algebraically = square root of variance
(square root of Σ (y- ȳ)2 / n
or with Bessel’s correction: Σ (y- ȳ)2 / n - 1

Answer

A

shows how close a data point is to the mean of the sample – BUT observations in a sample are always closer to their own mean than to the population mean. SO uncorrected SD is a biased estimator (OK as purely descriptive statistic)

Question 9

Q

What is the trick for comparing performance btw very different-looking values (e.g., meters run vs. time ran)?

Answer

A

Standard deviation!
(use as a “ruler” to measure distance from the mean)

expressing distance with SD “standardizes” the performances

Question 10

Q

z-score 1

allows us to compare apples and oranges (eliminates units)

Answer

A

letter z denotes values that have been standardized!!
(with mean & SD)

z = y - ȳ / s

z-score = performance - mean performance / standard deviation

Question 11

Q

z-score 2

Comparsion shows us which score is more extraordinary

Answer

A

z-scores have NO UNITS
they tell us how far the data is from the mean
2 = 2 SD above the mean
-1.5 = 1.5 SD below the mean

Question 12

Q

shifting data

plus or minus

Answer

A

Only measures of position change (center, min, max)

Neither shape nor spread changes (range, IQR, SD)

Question 13

Q

rescaling data

multiply or divide

Answer

A

all measures of position (mean, median) and spread change

shape remains constant

Question 14

Q

standardizing into z-scores shifts data by the mean and rescales them by the standard deviation

Answer

A

Shape stays constant
center changes (mean = 0)
spread changes (SD = 1)

Question 15

Q

A statistical model is always wrong. Explain.

Answer

A

it is “wrong” in the sense that it doesn’t match reality exactly

Question 16

Q

Normal model

Answer

Study These Flashcards

A

a way to show how extreme a z-score is

Question 17

Q

N (μ, σ)

(Normal model

Answer

Study These Flashcards

A

mew μ, sigma σ
μ = mean of a Normal model
σ = standard deviation of a Normal model

Question 18

Q

Greek letters

Answer

Study These Flashcards

A

NOT numerical summaries of data – they are part of the model, parameters

Question 19

Q

Latin letters

Answer

Study These Flashcards

A

summaries of data, statistics

Question 20

Q

standardized data for model

Answer

Study These Flashcards

A

z = y - μ / σ (for parameters)

cf.
z = y - ȳ / s (for statistics)

Question 21

Q

68 - 95 - 99,7 rule

Answer

Study These Flashcards

A

In a Normal model:
68% of values fall within 1 SD of the mean
95% of values fall within 2 SD of the mean
99,7% of values fall within 3 SD of the mean

week 4 -- SD as a ruler & z-scores Flashcards

(21 cards)