Re-expressing Data Flashcards
Re-expression
Do this by taking the logarithm, the square root, the reciprocal, or some other mathematical operation of all values of a variable.
Tukey’s Rule for Re-expression Graph
Tukey’s Rule if the data have a shape similar to that shown in the first quadrant (high x, high y):
If the data have a shape similar to that shown in the first quadrant (high x, high y), then the data analyst tries re-expressing by going up-ladder for X, Y or both.
Tukey’s Rule if the data have a shape similar to that shown in the second quadrant (low x, high y):
If the data have a shape similar to that shown in the second quadrant (low x, high y), then the data analyst tries re-expressing by going the down-ladder for X, and/or up-ladder for Y.
Tukey’s Rule if the data have a shape similar to that shown in the third quadrant (low x, low y):
If the data have a shape similar to that shown in the third quadrant (low x, low y), then the data analyst tries re-expressing by going down-ladder for X, Y or both.
Tukey’s Rule if the data have a shape similar to that shown in the fourth quadrant (high x, low y):
If the data have a shape similar to that shown in the fourth quadrant (high x, low y), then the data analyst tries re-expressing by going the up-ladder for X, and/or down-ladder for Y.
Exponential Re-expression
x, log (y)
This model is the “0” power in the ladder approach, useful for values that grow by percentage increases.
Logarithmic Re-expression
log (x) , y
Useful when data has wide range of x-values or scatterplot descending rapidly at the left but leveling off toward the right
Power Re-expression
log (x) , log (y)
The Goldilocks Model:
When one of the ladder’s powers is too big and the next is too small, this one may be just right.