Lesson 6 Flashcards

Question 1

Q

When talking about a quantitative
variable, there are (at least) three
important aspects to discuss:

Answer

A

The center, The variability
/ spread,The shape

Question 2

Q

When we talk about the center, we’re talking
about an “average” or “typical” value in the
dataset. This could refer to any of the following:

Answer

A

Mean (most common), Median, mode (rare cases)

Question 3

Q

If we’re talking about a sample mean, we use

Answer

A

𝑥 (called “x bar”)

Question 4

Q

If we’re talking about a population mean, we
use

Answer

A

µ (the Greek letter mu)

Question 5

Q

In each case below, would it make more sense to use the mean, median, or mode to
describe the center of the data?
1. Income for residents of Elon, NC
2. Heights of newborn babies
3. Number of siblings
4. Exam scores in a class where all students did well

Answer

A

Income for residents of Elon, NC
Median since there would likely be outliers and it would not be spread out evenly.
Heights of newborn babies
Mean since the data values would be spread out evenly.
Number of siblings
Mode (also median or mean)
Exam scores in a class where all students did well
Both Mean and Median

Question 6

Q

The simplest measure of spread / variability is the range. The range is
the biggest value minus the smallest value. In other words,

Answer

A

Range = maximum value – minimum value

Question 7

Q

An alternative way to measure the variability / spread is with the

Answer

A

Standard Deviation, The standard deviation is like a “typical” distance that a value might be
away from the mean. Let’s think about this with our exam example.

Question 8

Q

Which will be more resistant to outliers? The range or the standard
deviation?

Answer

A

The standard deviation is more resistant to outliers than the range.

Question 9

Q

Do you think the measure of spread you chose will be impacted much
by outliers or not really? What is your reasoning?

Answer

A

However, it is probably still impacted by them (you can test this with
the Mike example). It does use all of our data values in the calculation,
but it will be less impacted than the range.

Question 10

Q

Percentiles are..

Answer

A

Percentiles are considered a measure of location. They tell
you where a data value lies within your data by describing
the percentage of the values that are below a specific
value.
E.g. Ben Wyatt’s SAT score was at the 98th percentile.

Question 11

Q

Traditionally, percentiles are

Answer

A

Whole numbers,If you are at
the kth percentile, then k% of the sample (or population)
is below you.
Ben Wyatt’s SAT score was at the 98th percentile → 98% of
people had an SAT score below Ben’s score.
To calculate these, we count the number of other
observations below a value, divide by the total number of
observations, then convert to a percentage.

Question 12

Q

During the 2019-2020 season, Elon’s women’s soccer team
scored 2.1 goals per game. This was the 31st best out of 335
NCAA Division 1 teams. Find their percentile rank.

Answer

A

335 – 31 = 304 teams below Elon.
(304 / 335) * 100 = 90.7
Elon’s goals per game was at the 90th percentile
(Note: we will always round down for percentiles)

Question 13

Q

Quartiles are a..

Answer

A

specific type of percentile. They split our data into
quarters by representing the 25th, 50th, and 75th percentiles.
* First quartile (or Q1) has 25% of the data below it.
* Second quartile (or Q2) has 50% of the data below it.
* Third quartile (or Q3) had 75% of the data below it.

Question 14

Q

Question: What is another name for the second quartile?

Question 15

Q

One other measure of spread/variability that you may
encounter is called the

Answer

A

interquartile range or IQR.

Question 16

Q

Rather than looking at the entire range (max – min) we look at the range between our
quartiles (Q3 – Q1).
Why won’t the IQR be impacted by outliers like the range was?

Answer

Study These Flashcards

A

Any outliers will be below the first quartile or above the third quartile! This means the
IQR is more resistant to outliers than the range (and the standard deviation).

Question 17

Q

How do you find the weighted avg

Answer

Study These Flashcards

A

Multiply each data value by its weight
* Add these together (first sum)
* Add your weights together (second sum)
* Divide the first sum by the second sum
As a formula: σ(𝑣𝑎𝑙𝑢𝑒∗𝑤𝑒𝑖𝑔ℎ𝑡)

Question 18

Q

Answer

Study These Flashcards

A

Lesson 6 Flashcards

(18 cards)