Chapter 3 Flashcards by Courtney Sullivan

Descriptive Statistics

Summarize or describe relevant characteristics of data

How well did you know this?

Not at all

Perfectly

Mean

Average

How well did you know this?

Not at all

Perfectly

Σx

Sum of all data values

How well did you know this?

Not at all

Perfectly

Median

Middle value when all data are set in numerical order (count)

How well did you know this?

Not at all

Perfectly

Mode

Value that occurs with the greatest frequency in a data set

How well did you know this?

Not at all

Perfectly

Bimodal

Two modes

How well did you know this?

Not at all

Perfectly

Multimodal

More than 2 modes

How well did you know this?

Not at all

Perfectly

No mode

No data value is repeated

How well did you know this?

Not at all

Perfectly

Midrange

Largest value+minimum data value/2

How well did you know this?

Not at all

Perfectly

Rounding rule

Round to one place greater than the data

How well did you know this?

Not at all

Perfectly

Nominal level data

Doesn’t make sense to measure center numbers

ranks, zip codes, things that aren’t measurements

How well did you know this?

Not at all

Perfectly

Mean from a frequency distribution

Sum of all class midpoints/sum of frequencies
x̅=Σ(f*x)/Σf

How well did you know this?

Not at all

Perfectly

x̅

Mean

How well did you know this?

Not at all

Perfectly

Frequencies

How well did you know this?

Not at all

Perfectly

Class midpoint for frequency distribution, value in weighted mean, frequencies in s, magic in σ

How well did you know this?

Not at all

Perfectly

Weighted Mean

data contributes more significance than another: break it down

How well did you know this?

Not at all

Perfectly

Weighted mean formula

x̅=Σ(w*x)/Σw

How well did you know this?

Not at all

Perfectly

Skewed distribution

Data plot is more on one side than the other

How well did you know this?

Not at all

Perfectly

Skewed to the left

Study These Flashcards

Negatively skewed

Skewed to the right

Study These Flashcards

Positively skewed

Symmetric Data

Study These Flashcards

Zero skewness: mean, median, mode are same

Range

Study These Flashcards

Largest value-smallest value

Standard Deviation for a sample (s)

Study These Flashcards

Measure of variation of values about the mean.
s=√ nΣ(x^2)-(Σx)^2/n(n-1)
n=#values
x=frequencies

Standard Deviation for a population (σ)

Study These Flashcards

Measure of variation of values about the mean.
σ=√Σ(x-μ)^2/N
N=pop. size
μ=mean of pop
x=some magic # you pull out of your ass

Variance

s^2, σ^2 s^2 tends to be close to σ^2, making s^2 an unbiased predictor of σ^2. But difficult to understand caz different that original unit.

Rule of thumb

95% of data lies between 2 SD of the mean

Estimate Min & max data values

x̅-(2*s), x̅+(2*s)

Estimate SD

s=range/4

Empirical rule for bell shaped

68% of data falls within 1SD of mead, 95% 2SD, 99.7% 3SD

Chebyshev's Theorum

For any distribution the proportion of data values lying with K SD of the mean is always at least 1-1/K^2, where K is any positive #>1 K=SD from mean

sample SD

Pop. SD

s^2

Sample variance

σ^2

Pop. variance

<< SD

Values in data set are close together

>>SD

Values in data set have large variation

Z score (standardized value)

The # of SD that a given value x is above or below the mean. | z=x-x̅/s or z=x-μ/σ

Usual z scores

-2 < Z SCORE < or equal to 2 | Unusual data is called outlier data

Percentiles

Relative position of a data value compared to the data set in 100 groups. Data is _% BELOW a #

Percentile of x equation

x=100(#values below x)/(total# values)

Quartiles

Divides group into 4 parts Q1=P25, Q2=P50, Q3+P75

Interquartile range (IQR)

IQR=Q3-Q1

5 number summary box plot

Minimum, Maximum, median, Q1, Q3

Outliers

Data above Q3 or below Q1 by an amount > 1.5 IQR

Estimate range

min=x̅-(2*s), max=x̅+(2*s) OR range=s*4

Coefficient of variation

s/x̅*100, σ/μ*100 described sd relative to mean

Chapter 3 Flashcards

(46 cards)