Describing Distributions Flashcards
If the graph of your data is skewed, what center and measure of spread should you use?
Center: Median
Measure of Spread: IQR
How to interpret R2 (Coefficient of Determination)
Coefficient of Determination : (R2 as a percent) of the change in (Y) can be explained by the change in (X)
Example Problem:
Given that R2 is .99, Y = height, and X= age, describe what it means in the context of the situation.
-99% of the change in height (Y) can be explained by the change in age (X)

Pie Chart

Stem Plot
What type of graph would you make with this information?

Ogive

Dot Plot
What is the center?
The middle of the distribution, first located by finding the highest peak. It is measured by mean, median, or mode.
What is IQR?
What is its equation?
When do you find the IQR for spread?
- Inner Quartile Range
- Q3-Q1=IQR
- You use it when the shape of the graph is skewed

Histogram

Bar Graph
What is mean; its equation?
The average; the sum of the data divided by the number of numbers.

What is a median?
The middle # when the numbers are in numerical order.
What is the mode and when is it used to describe the center of a graph?
- The number that occurs the most
- When the graph’s shape is bimodal
What is a Population?
Used to measure all possible outcomes in a particular study.
How do you find the outliers in a set of data?
Low outliers: Q1-(1.5*IQR)
High outliers: Q3+(1.5*IQR)
True or False. When looking at the United States, the city of New York is a population.
False
Practice Problem A study found correlation r=0.61 between the sex of a worker and his or her income. You conclude that
A. Women earn more than men on the average.
B. Women earn less than men on average.
C. An arithmetic mistake was made; this is not a possible value of r.
D. This is nonsense because r makes no sense here.
D. This is nonsense because r makes no sense here.
Practice Problem
Which of the folowing would not be a correct interpretation of a correlation of r=-.30?
A. The variances are inversely related.
B. The coefficient of determination is 0.09.
C. 30% of the varitaion between the variables is linear.
D. There exists a weak relationship between the variables.
E. All of the above statements are correct.
C. 30% of the varitaion between the variables is linear.
What is the value of correlation for operating cost per hour and number of passenger seats in the plane? Interpret the correlation.
r=0.75
There is a strong positive linear relationship between the operating cost per hour and the number of passenger seats on the plane.
Practice Problem A copy machine dealer has data on the number x of copy machines at each of 89 customer locations and the number of y service calls in a month at each location. It was given that r=0.86. What percent of the variation in number of service calls is explained by the linear relation between number of service calls and number of machines?
A. 86%
B. 93%
C. 74%
D. None of these
E. Can’t tell from the information given
C. 74% The question is asking for r2 and since r=0.86, you would square 0.86 and get 0.74.
What is a qualitative variable?
A variable that describes non numerical values.
What are the two types of qualitative graphs covered in class?
Pie Chart
Bar Graph
What is a Quantitative variable?
A numerical variable that represents a measurable quantity.

