Describing Distributions Flashcards

1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If the graph of your data is skewed, what center and measure of spread should you use?

A

Center: Median

Measure of Spread: IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to interpret R2 (Coefficient of Determination)

A

Coefficient of Determination : (R2 as a percent) of the change in (Y) can be explained by the change in (X)

Example Problem:

Given that R2 is .99, Y = height, and X= age, describe what it means in the context of the situation.

-99% of the change in height (Y) can be explained by the change in age (X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
A

Pie Chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
A

Stem Plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of graph would you make with this information?

A

Ogive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A

Dot Plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the center?

A

The middle of the distribution, first located by finding the highest peak. It is measured by mean, median, or mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is IQR?

What is its equation?

When do you find the IQR for spread?

A
  • Inner Quartile Range
  • Q3-Q1=IQR
  • You use it when the shape of the graph is skewed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
A

Histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
A

Bar Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is mean; its equation?

A

The average; the sum of the data divided by the number of numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a median?

A

The middle # when the numbers are in numerical order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mode and when is it used to describe the center of a graph?

A
  • The number that occurs the most
  • When the graph’s shape is bimodal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Population?

A

Used to measure all possible outcomes in a particular study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you find the outliers in a set of data?

A

Low outliers: Q1-(1.5*IQR)

High outliers: Q3+(1.5*IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False. When looking at the United States, the city of New York is a population.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Practice Problem A study found correlation r=0.61 between the sex of a worker and his or her income. You conclude that

A. Women earn more than men on the average.

B. Women earn less than men on average.

C. An arithmetic mistake was made; this is not a possible value of r.

D. This is nonsense because r makes no sense here.

A

D. This is nonsense because r makes no sense here.

13
Q

Practice Problem

Which of the folowing would not be a correct interpretation of a correlation of r=-.30?

A. The variances are inversely related.

B. The coefficient of determination is 0.09.

C. 30% of the varitaion between the variables is linear.

D. There exists a weak relationship between the variables.

E. All of the above statements are correct.

A

C. 30% of the varitaion between the variables is linear.

13
Q

What is the value of correlation for operating cost per hour and number of passenger seats in the plane? Interpret the correlation.

A

r=0.75

There is a strong positive linear relationship between the operating cost per hour and the number of passenger seats on the plane.

14
Q

Practice Problem A copy machine dealer has data on the number x of copy machines at each of 89 customer locations and the number of y service calls in a month at each location. It was given that r=0.86. What percent of the variation in number of service calls is explained by the linear relation between number of service calls and number of machines?

A. 86%

B. 93%

C. 74%

D. None of these

E. Can’t tell from the information given

A

C. 74% The question is asking for r2 and since r=0.86, you would square 0.86 and get 0.74.

15
Q

What is a qualitative variable?

A

A variable that describes non numerical values.

16
Q

What are the two types of qualitative graphs covered in class?

A

Pie Chart

Bar Graph

17
Q

What is a Quantitative variable?

A

A numerical variable that represents a measurable quantity.

18
Q

What are the four types of quantitative graphs covered in class?

A

Stem Plot

Histogram

Dot Plot

Ogive

20
Q

which of the following is Quantitative?

a. age
b. time
c. gender
d. number of girls in a class
e. number of people in a class

A

a and e

22
Q

What is range?

What is its equation?

When do you find range?

A
  • The maximum value minus the minimum value in the data set
  • Max- Min
  • You find the range for boxplots and its five-number summary
23
Q

What is the range for correlation coefficient?

A

-1 to 1

25
Q

How do you find the residual?

A

y-ŷ

26
Q

What is a Sample?

A

A portion of a population

27
Q

True or False. A class in a school is a sample.

A

True

28
Q

What are the possible measures of spread?

A

Range

IQR

Variance

Standard Deviation

29
Q

What is standard deviation?

What is its equation?

When do you use standard deviation to find spread?

A
  • The average distance from the mean
  • You use it when the shape of the graph is symmetric
31
Q

What is a two way table?

A

IDK, There is a row, column. Add all the number of column and row will give the total amount of the study. 

32
Q

What is variance?

What is its equation?

A
  • The square of standard deviation
34
Q

What are the things required to interpret correlation?

A

Form, strength, and direction

36
Q

What is a Risidual 

A

The vertical distance from a expected point to the regression line.

37
Q

What is the equation to find the slope (b)?

A

b=r(Sy/Sx)

39
Q

What is the general equation for a least square regression line

A

ŷ=a+bx

40
Q

What are the symbols for Correla2tion and Coefficient of Determination?

A

R = Correlation

R2 = Coefficient of Determination

41
Q

Which type of quantitative graph organizes your one variable data chronologically so that, for example, all of the data in the 20-29 region would be grouped together with a 2 on one side of a line and a list of the changing ones place (2 2 3 4 4 7)?

A

Stem Plot

42
Q

Which type of 1-variable quantitative graph does not necessarily need a y-axis label and stacks a dot over it’s x value for each count?

A

Dot Plot

43
Q

Which quantitative graph divides its data into Bin and Counts?

A

Histogram

44
Q

What quantitative graph divides its x-values into groups and then adds the previous y-values quantity to its own?

A

Ogive