Stats Flashcards
What is an index?
A change compared to a base value expressed in %.
Eg. Today $ / 2006 $ (x100) = 140
Equivalent to a 40% increase over 2006
Compare/contrast percentage vs. Index
Percentages compare only to its own point of reference. Index standardize the point of reference.
Absolute vs. Relative frequencies
Absolute is exact numbers. Relative is numbers within a range. Eg. 10-20
Line graph vs. Histogram
Line for relative frequencies
Bar centered on center point of range for frequency, touching other bars
EQ: mean (what, how to calc)
Sum / # of entries
Gives average value
E+, then x,y key
EQ: weighted arithmetic mean
Sum of (variable x weight) / sum of weights
Gives average
Mean vs median
Mean higher than median means bulk of data is lower and there is a right side tail (high potential outliers)
Mean lower than median means bulk of data is higher and there is a left side tail (low potential outliers)
When is mode useful?
Non-numeric data. Eg. Fav colours
EQ: Standard deviation
Looks like o with a tail
Value - mean, square it, sum them,divide by n, square root it
Gives how tightly clustered the values are. High = flat curve (loose)
Calc: sum values then ox, oy
Standard deviation intervals for a normal distribution
1 = 68.27%
2 = 95.45%
3 = 99.73%
EQ: Variance
Sum of (value - mean)^2 / n
OR std dev squared
Not very useful
EQ: coefficient of variation
SINGLE VARIABLE
= std dev / mean x 100
Shows how big the standard deviation is compared to the mean. Larger means more dispursed or more variable
BIVARIATE
=SEE / mean (dependent variable) x 100
tells us the size of the error as compared to the mean (%)
MULTIVARIATE
=same as bivariate
When is the median preferable to the mean?
- Open-ended frequency group
- Extreme values
EQ: Linear correlation coefficient (r)
Shows relationship between tow variables. Value from -1 to 1. Zero means no correlation. r > 0.8 strong, r < 0.4 weak.
BIVARIATE
Calc: x1 INPUT y1 [E+], [x^,r] ,SWAP
Cons: cannot describe non-linear relationships
Multivariate counterpart is coefficient of determination (R^2)
EQ: Sum of least squares (S)
BIVARIATE
(Actual - predicted)^2, then sum
Univariate would be variance before dividing by n